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P reface 


This instructor’s manual is designed to accompany the textbook, Introduction to 
the Theory of Computation, by Michael Sipser, PWS Publishers, 1997. It con¬ 
tains solutions to almost all of the exercises and problems appearing in Chapters 
1-5 and 7 along with a handful of solutions to problems in other chapters. Most 
of the omitted solutions in the early chapters require figures, and producing these 

required more work that we were able to put into this manual at this point. The 

6 

next edition will be more complete. 

This manual is available only to instructors and only in hardcopy form. Requests 
for copies should be directed to: 606—282—4629, fax 606—647—5020. 

Thanks to Jonathan Feldman for sending in corrections to a draft of this manual. 
An errata site for this instructor’s manual maintained at 


http://www-math.mit.edu/"'sipser/itocsm-errsl. 1.html . 

The email address for correspondence regarding the textbook and the instructor’s 
manual is sipserbooktoath.mit.edu . 
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Chapter 0 


Here is a sketch of the solution. Make two piles, A and B, of nodes; initially 
empty. Then, starting with the entire graph, add each remaining node to A 
if its degree is greater than 1/2 of all remaining nodes and to B otherwise, 
then discard all nodes to which it isn’t (is) connected if it was added to A 

(B). Continue until nothing is left. 


0.12 




Chapter 1 


a. The start state of M\ is q \. 

b. The set of accept states of Mi is {< 72 }- 

c. The start state of M 2 is qi. 

d. The set of accept states of M 2 is { 91 , 94 }- 

On input aabb Mi goes through the state sequence 91 > 92 , 93 , Qu gi¬ 
f' Mi does not accept aabb . 
g. M 2 does accept e. 


1.1 




a. Mi = ({ 91 .92. 93 },{a>b},<5i, 9i, {? 2 »- 

The transition function is 


1.2 






91 92 91 

92 93 93 

93 92 9i 

b. Af 2 = ({ 91 . 92 , 93 . 94 }.{a»h}, 52 . 9 i.{9i.94})- 

The transition function 5 2 is 


a b 

91 9i 92 

92 93 94 

93 92 9i 

94 93 94 


NFA N with only a 
e language as N. Informally, N' 


(Q,E, S,qo,F) be any NFA. We build 

single accept state that accepts the s 
looks exactly like N except it has e-transitions from states correspond¬ 
ing to accept states of N to a new special accept state, 9 ac cept- Finally 
the state 9accepthas no transitions coming out of it. More formally, N' = 
(Q U {gaccept } , E,5',9o, {g&ccept})- Where for any 9 € Q and o € E: 


Let N 


ll 


1. 


' 


^ • ll 


ifo^e or qp. F 

t jmd 9 € F 




5(9, a) 

5 ( 9 , ®) U { 9 &ccept} if O 


O ( 9 , a) 


l 


and S f (^accept»Q-) 


for a € S e . 




Theory of Computation 


a. Let Af' be the DFA M with the accept and non-accept states swapped. We 

will show that Af 7 recognizes the complement of J3, where B is the language 
recognized by M. Suppose M' accepts x. If 

accept state of M '. Because M and M* have swapped accept/non-accept 

states, if we run Af on x, we would end in a non-accept state. Therefore, 

x & B. Similarly, if x is not accepted by Af', then it would be accepted by 

M. So Af' accepts exactly those strings not accepted by Af. Therefore, Af' 
recognizes the complement of B. 

Since B could be any arbitrary regular language and our construction shows 
how to build an automaton to recognize its complement, it follows that the 

w * ’•.* if f - • 

complement of any regular language is also regular. Therefore, the class of 
regular languages is closed under complement. 

b. Consider the NFA in exercise 1.12(a). The string a is accepted by this au¬ 
tomaton. If we swap the accept and reject states, the string a is still accepted. 
This shows that swapping the accept and non-accept states of an N FA doesn’t 
necessarily yield a new NFA recognizing the complement of the original 
The class of languages recognized by NFAs is, however, closed under comple¬ 
ment. This follows from the fact that the class of languages recognized by 

NFAs is precisely the class of languages recognized by DFAs which we know 
is closed under complement from part (a). 


1.10 


e run M 1 on x we end i 


kVi 


I 


kYi 


one. 


Let £ = { 0 , 1 }. 

a. l £*0 

b. £*l£*l£*l£* 

c. £*01011!* 

d. ££0£* 

e. (0U1 £)(££)* 

f. (0U(10)*)*1* 

g. (e U £)(e U £)(e U £)(e U E)(e U £) 

h. £*0£* U 1111£* U 1 Ue 

i. (l£)*(l U e) 

j. 0*(100U 010 U 001)0* 

k. eUO 

l. (1*01*01*)* U 0*10*10* 

m. 0 
n. ££* 


1.13 


1.15 


a. ab,e;ba,aba 

b. ab,abab; e ,aabb 

c. £,aa:ab,aabb 


d. e,aaa;aa.b 


e. aba,aabbaa; e ,abbb 

f. aba,bab;e,ababab 


k] 
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g. b,ab;£,bb 

h. ba,bba;b,e 


In both parts we first add a new start state and a new accept state. Several 

possible, depending on the order states are removed. 

a. Here we remove state 1 then state 2 and we obtain 

b(a U ba*b)* 

b. Here we remove states 1, 2, then 3 and we obtain 

U ((a U b)a*b((b U a(a U b))a*b)*(e U a)) 


1.16 


solutions 




ai 






A\ = { 0 n l n 2 n | n > 0}. Assume that A\ is regular. Let p be the pumping 
length given by the pumping lemma. Choose 8 to be the string 0 P 1 P 2 P . 

Because s is a member of A\ and s has length more than p, the pumping 

where for 


1.17 




lemma guarantees that s can be split into three pieces 

any i > 0 the string xy'z is in A\. 

i) The string y consists only of Os, only of Is or only of 2 s. In these cases 

the string xyyz will not have equal number of Os, Is and 2s, leading to a 

■ 

contradiction. 

ii) The string y consists of more than one kind of symbol. In this case xyyz 

will have the Os, Is or 2 s out of order. Hence it is not a member of Ai, 

which is a contradiction. 


s = xyz, 


Therefore, A\ is not regular. 

b. A 2 = {tutu| w € {a,b}*}. Assume that A 2 is regular. Let p be the pumping 

length given by the pumping lemma. Choose a to be the string a p ba p b. The 

can be split into three pieces, s = xyz, 


pumping le mm a guarantees that a 
where for any t > 0 the string xy'z is in A 2 . 


only. In both cases that y is left or right of 


The string y consists of a s 

the middle b, xyyz would not be in the form of ww. 

is a p b. The condition \xy\ < p would be violated. 

contains the center b and y £ a p b. Therefore, xyyz would 


ii) The string y 

’•W' *■ -4. .1 . ;■ V 


not be in the form of ww. 


contradictions, Ai is not regular. 


Since all cases cause 

{a 2 " | n > 0}. Assume that A 3 is regular. Let p be the^pumping length 

given by the pumping lemma. Choose s to be the string a 2 . The pumping 

can be split into three pieces, s = xyz, where for 


c. A 3 


lemma guarantees that s 


i > 0 the string xy'z is in A 3 . 

The shortest string in A 3 which is longer than a 
the length of s f is double the length of s. The only way that xyyz could be in 

A 3 is by letting y be a 2P . However, that would violate the pumping lemma 
condition that |xy| < p. Therefore, A 3 is not regular. 


any % 


2P+ 1 


a 22 '. Thus 


2 * : 


is 8 = a 




xyz, where x 


0 , y 


0 P 1 P can be pumped. Let s 
0 P “ 2 1 P . The conditions are satisfied because 


The error is that 8 
and z 


1.18 




Theory of Computation 


i) for any i > 0, xy'z = 00‘0 P-2 1 P is in 0*1*. 

ii) |y| = 1 > 0, and 

iii) \xy\ = 2 < p. 


1.19 


• 9i,9i,9i,9i> 000. 

b. 9i)92>92)92) HI. 

c. 9l>91)92)91)92) 0101. 

d* 91)93! 1- 

e. 91)93,92)93)92; 1111. 
f* 91,93,92,91,93,92,9i! 110110. 
g> 9i! «• 




4 


A finite state transducer is a 5-tuple (Q, E, T, <5, g 0 ), where 




i) Q is a finite set called the states, 

ii) Ei 


a finite set called the alphabet, 

iii) r is a finite set called the output alphabet, 

iv) 8 : Q x E 




Q x r is the transition function , 


v ) Qo € Q is the start state . 


Let M — (Q, £,r,<$,<7o) be a finite state transducer , 
string over E, and v 

v if a sequence of states r 0 ,ri 
conditions: 


w n be a 

v n be a string over the P. Then M outputs 

r n exists in Q with the following two 




W1W2 


• • • 


V 1 V 2 


• • • 




• • • 




i) r 0 

ii) 8(r it w i+ i) = (r i+1> v i+ i) for i 


9o 




1 . 






• • • 




• Ti = (Q, qi), where 

0 Q — { 91 , 92 }, 

ii) E = {0,1,2}, 

iii) r = {0,1} 

iv) 8 is described as 


1.21 






v) qi is the start state. 

b. T 2 = (Q,E,r,J,gx), where 

* * 

i) Q — { 91 , 92 , 93 } 

ii) E = {a,b} 

rn) r = {0,1} 

iv) 5 is described 
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A 


(92,1) (93*1) 

(93*1) ( 9 ii°) 

(9i»°) (92, 1 ) 




92 


93 


v) qi is the start state. 


> 0} is regular. Let p be the pumping length 

0 p 10 p G L, and |s| > p. 

can be divided 


a. Assume L = {0 n l m 0 n | m,n 

given by the pumping lemma. The string s 

Thus the pumping lemma implies that s 
0 °,y = 0 b .z 


1.23 


xyz with x = 
p . However, the string 




= 0 C 10 P , where b > 1 and a + 6 + c 
0 a +c 10 p ^ since a + c < p. That contradicts the pumping 








xy~z 




lemma. 


the complement of C = {0 n 1" | n > 0> is regular, then C itself is regular 

have shown in Example 1.38 




b. If L 

using the result of Exercise 1.10a. However, 

that C is non-regular. So L cannot be regular. 

| m =£ n} is regular. Let p be the pumping length given by 

p+p!. The string a = 0 p l p+p! G L 






kVl 


c. Assume L = {0 m 1 

the pumping lemma. Set m 

and Is) > v . Thus the pumping lemma implies that s can be divided as xyz 

0 c l p + pl , where b> 1 and o+6+e = p. However, the 

QO+p!+6+c^p+p! = Qp+pi^p+p! ^ jj % That contradicts 




p and n 




A: 


= 0, z 

4+i 


o°,y 


with x - 

• 4 

string 8 

the pumping lemma. # # 

Alternative solution: The nonregular language {0 n 1" | n > 0} = C D 0 1 . 
Hence C cannot be regular because the regular languages are closed under 






• ' K 


intersection 


palindrome} is regular. Let p be the 

OP 10P € C 


{u;| w € {0,1}* i 

length given by the pumping lemma. The string s 

in part (a). Hence C isn’t regular, so 




lie 


-%• . 


pumping 


> p. Follow the argument 

r ‘ .' 

is its complement. 


;L 1 m! 


regular language A, let M x be the DFA recognizing it. We need 
DFA that recognizes A R . Since any NFA can be converted to an 

N FA M 2 that recognizes A R . 

We keep all the states in Mi and reverse the direction of all the arrows in 
jVfj. We set the accept state of M 2 to be the start state in Mi. Also 
introduce a new state g 0 as the start state for M 2 which goes to every accept 

(•transition. 


For any 
to find a 


1.24 


equivalent DFA, it suffices to find 


>At\ 


III! 




kVl 




A: 


kVi 


state in M\ by 

For every string s accepted by M \, the path that s follows i 

the start state and stops at one of the accept states, say, g e nd- K we 
s R to Af 2 , it starts at q 0 and forks into different paths starting at each of the 

accept states in M\. The computation can reach the accept state in M 2 by 

path as its reversed counterpart in M\ but in the opposite 


F: * el 


M\ starts at 


ifeii 


following the 
direction. 

Similarly, for every string w accepted by Af 2 , there exists a path P starting 
at o 0 go in g through g e „d as its next step and stopping at the unique accept 




AA i il 




Theory of Computation 


state. Converting the NFA M 2 back to its counterpart Mi and reversing the 

input string, we can get from the start state to q eni in M x by following the 
same path in the opposite direction. 

Therefore, M 2 recognizes A R if M x recognizes A. 


1.27 


The idea is that we start by comparing the most significant bit of the two 

rows. If the bit in the top row is bigger, we know that the string is in the 

language. The string does not belong to the language if the bit in the top 
row is smaller. If the bits on both rows are the s 


e, we move on to the next 
most significant bit until a difference is found. We implement this idea with 

a DFA having states qo, 91 , and 92 - State go indicates the result is not yet 

determined. States qi and 92 indicate the top row is known to be larger, or 

smaller, respectively. We start with q 0 . If the top bit in the input string is 

bigger, it goes to q lt the only accept state, and stays there till the end of the 

input string. If the top bit in the input string is smaller, it goes to 92 and 

stays there till the end of the input string. Otherwise, it stays in state q 0 . 


':mii 


1.28 


Assume language E is regular. Use the pumping lemma to a get a pumping 
length p satisfying the conditions of the pumping lemma. Set s = [ J ] p [ 1 ] 

Obviously, s e E and |s| > p. Thus, the pumping lemma implies that the 
string s can be written as xyz with 

b > 1 and a + 6 + c 


m°,y = [?] 6 


[?] C [o] P » where 

p. However, the string s' = xy°z = [°] a+c [J] p £ E, 
since a+c < p. That contradicts the pumping lemma. Thus E is not regular! 


x 




1.29 


For each n > 1, we build a DFA with the n states 90 , 91 , ... ,q n _ 1 to count 
the number of consecutive a’s modulo n read so far. For each character a 
that is input, the counter increments by 1 and jumps to the next state in M. 
It accepts the string if and only if the machine stops at qo. That means the 
length of the string consists of all a’s and its length is a multiple of n. 

More formally, the set of states oi M is Q = {q 0 ,q u ... , 9 n _ 1 }. The state q 0 

is the start state and the only accept state. Define the transition function 
as; 8(q iy a) = 9 ,- where j = i + 1 mod n. 


1.30 


By simulating binary division, we create a DFA M with n states that 


_ , recog- 

rnzes C7 n . Mhasn states which keep track of the n possible remainders of 

the division process. The start state is the only accept state and corresponds 
to remainder 0 . 


The input string is fed into M starting from the most significant bit. For 

each input bit, M doubles the remainder that its current state records, and 

then adds the input bit. Its new state is the sum modulo n. We double 

the remainder because that corresponds to the left shift of the computed 
remainder in the long division algorithm. 
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If an input string ends at the accept state (corresponding to remainder 0), the 
binary number hits no remainder on division by n and is therefore a member 

of C„. 

w 

The formal definition of M is ({go, • • • >3n-i}> {0> l}>^)3o> {3o})- For each 
qi € Q and b £ {0,1}, define 8(qu b) = q, where j 


(2 i + 6) mod n. 


NFA recognizing A, where A is some regular 
(Q',’£,5,q' 0 ,F') recognizing NOPREFIX(A) 


(Q,E,8,q 0 ,F) be 


. Let M 

language. We construct M' 
as follows: 


1.32 






i) Q' = Q 


8(r,a) R£F 

ReF. 


ii) For r € <2' and a € E define S' (r, o) 


3 


iii) To = 3o 

iv) F' = F 

b. Let M — {Q, E, (5, qo, F) be 

language. We construct M 
as follows: 


NFA recognizing A } where A is some regular 
(Q',E,6,qo,F') recognizing NOEXTEND(A) 


►rt II 




i) Q' = Q 


ii) S' 

iii) 3o 

iv) F' 






{q| there is no path from q to an accept state}. 


* . 




on input w. Consider 

T must output 00, and on input 

both cases the first 


Assume to the contrary that some FST T outputs w 

the input strings 00 and 01. On input 
01, T must output 10. That is impossible, because i 
input bit is a 0, but in one case the first output bit is a 0 and in the other 

case the first output is a 1, and the first output bit depends only on the first 

•* • •• 

input bit. Hence no such FST can exist. 




l. 






iiii 




, .. 


is an equivalence relation we show it is reflexive, symmetric, 


To show that 

and transitive. It is reflexive because no string can distinguish x from itself 

l, x for every x. It is symmetric because x is distinguishable 


1.34 


L 


and hence x 

from y whenever y is distinguishable from x. It is transitive because if w =l * 

l y, then for each z, wz £ L iff xz G L and xz £ L iff yz £ L, hence 

wz £ LiSyz £ L, and so w =l y- 


and x 


a. We prove this assertion by contradiction. Let M be a fc-state DFA that 

recognizes L. Suppose for a contradiction that L has index greater than k. 
That means some set X with more than k elements is pairwise distinguishable 

r 

by L. Because M has k states, the pigeonhole principle implies that X 

contains two distinct strings x and y where S(qo,x) = S(qo,y). Here S(qo,x) 
is the state that M is in after starting in the start state qo and reading 


1 . 







* 




ry of Co 




utat 


W 1 




r 


* 


v 


input string x. Then, for any string z € E*, <J(g 0 , 
either both xz and yz 

aren’t distinguishable by L, contradicting our assumption that X 
distinguishable by L. 

*. •* * 

b. Let X = ,a*} be pairwise distinguishable by L. We construct DFA 

M = (Q, E, S, go, F) with k states recognizing L. Let Q = {gi,... ,g*}, and 
define < 5 (g<, a) to be qj where Sj = L SjO (the relation 
1.34). Note that 8j =i s<a for some a,- € X othe 
elements and would be pairwise distinguishable 
our assumption that L has index k. Let F 
state, go, be the g, such that s; 

9 »» {*1 *)=©} = {*| 9 

c: Suppose L is regular and let k be the number of 


xz) = S(q 0 ,yz). Therefore 

x and y 
is pairwise 


IS 




till 






ill! 


> 


4 




« 




‘ 


l 


* 


> 


l is defined in problem 

• • *• 

* .. s • ,» • t * • 

•a would have fc-f 1 
by L and that contradicts 

{®f| *i ^ 

l e. M is constructed so that, for any state 
l s t }. Hence M recognizes L 




A 


I* 










* 


e* 


. 


■t • 




4 . - 


» 




! 


Let the start 


f 


. •/ 




/ 


•• 




\ 




a DFA recognizing 

L. Then from part (a) L has index at most fc. Conversely, if L has index fc, 
then by part (b) it is recognized by a DFA with fc states, and thus is regular. 
To show that the index of £ is the size of the smallest DFA accepting it, 
suppose that L’s index is exactly fc. Then, by part (b), there is a fc-state 
DFA accepting L, That is the smallest such DFA since if it were any smaller, 
then we could show by part (a) that the index of L is less than fc. 


> 


states 


till 


{ 


IT; 


< 


A 






> > 


* 


* 






;• • r 




*• 










A* 






u *' ' :■ • 




/ 


' • . 


; 


i 


v 


i 


* 




Assume to the contrary that ADD is regular. Let p be the 
given by the pumping lemma. Choose 
a member of ADD 




1. 


SR 


* 


pumping length 


T •#/ . • 


* • * 


s to be the string 


1 P * 0 P + 1 P , which 

the pumping 

tisfy 

pumping lemma 
some fc > 1 . Then xy 2 z is the 
is not a member of ADD , violating the pumping 


. *s 














s has length greater than p, 

lemma guarantees that s can be split into three pieces, s 
the conditions of the lemma. By the third condition 
have that \xy\ < p, it follows that y is 1 * for 
string l p+fc *O p +l p , which i 

lemma. Hence ADD isn’t regular. 


IF; 


. 


4 




f 


’ I 


xyz 


•;f;i 






i 


A 


* 


• f 






/ 


* 


\ • 




• i 


* 


* • 












\ 




) 


1.37 


Note: The version of the problem that appears in the first printing is in¬ 
correct. The correct version (which appears in the second and subsequent 
printings) is: 




* 






Show that the language F = {aVc*| i, j, k > 0 and if i 


1 then j = fc} 

satisfies the three conditions of the pumping lemma even though it is 
regular. Explain why this fact does not contradict the pumping lemma. 

Language F satisfies the conditions of the pumping lemma using pumping 
length 2 . If 8 € F is of length 2 or more we show that it can be pumped by 
considering four cases, depending on the number of a’s that s contains. 


not 




i) If 8 is of the form b*c*, let x 

be the rest of s. 

. - ’ •. *# 

ii) If a i 

be the rest of s. 

% 

* _ 

iii) If s is of the form aab*c*, let x 

let z be the rest of a. 


k 


e, y be the first symbol of s, and let z 


< 


t 


> 


\ 4 


4 *■ ' 


of the form ab*c 


let x 




y be the first symbol of s, and let z 


IF; 




f 


t 


r 






* 


£, y be the first two symbols of s, and 
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e, y be the first symbol of s, and let 


iv) If s is of the form aaa*b*c*, let x 

z be the rest of s. 

In each case, the strings xy % z are members of F for every i > 0 . Hence F 
satisfies the conditions of the pumping lemma. However, F is clearly not 

regular, because the nonregular language {ab 
F H ab*c*, and the regular languages are closed under intersection. The 

pumping lemma is not violated because it states only that regular languages 

satisfy the three conditions, and it doesn’t state that nonregular languages 

fail to satisfy the three conditions. 


n > 0} is the sa 


n^n 


e as 


IM! 




The objective of this problem is for the student to pay close attention to the 

ft 

exact formulation of the pumping lemma. 


1. 




000 6 0001* cannot be 


a. The minimum pumping length is 4 . The string 8 

pumped, and any string in 0001* of length 4 or more contains a 1 and hence 

1 , and z is the rest. 


can be pumped by dividing it so that x = 0, y 
b. The minimum pumping length is 1 . The string e cannot be pumped because 

of condition 2 in the pumping lemma and hence 0 is not a pumping length. 

Any string in 0*1* of length 1 or more contains a 0 or a 1 and hence can be 

0 or 1, and z is the rest, so 1 is a 


pumped by dividing it so that x 

pumping length. 


e* V 


pumping length is 1 . The pumping length cannot be 0 , as in 
part (b). Any string in (01)* of length 1 or more contains 01 and hence can 
be pumped by dividing it so that x = c, y = 01, and z is the rest. 

d. The minimum pumping length is 3 . The string 01 cannot be pumped, so 2 

I ® 

is not a pumping length. All strings in the language of length 3 or more can 
be pumped, (note that there aren’t any such string, so the statement is true 

vacuously) so 3 is a pumping length. 

e. The minimum pumping length is 1 . String e cannot be pumped because of 

condition 2 in the pumping lemma and hence 0 is not a pumping length. The 

so 1 is a pumping length, (the 


c. The mini 


liTiTiiliiTi 1111 


language has no strings of length 1 or more 


conditions hold vacuously). 


E*0* ! “ 1 0*. A DFA with k states can recognize A*. In addition to 


Let Ak 

the start state, it has one state for each number of 0s it has read since the 
last 1. After k -1 0s the machine enters a accept state whose transitions 


1. 








self-loops. 

t . - > 

In any DFA with fewer than k states, two of the k strings 1, 10, ... 
must cause the machine to enter the same state, by the pigeon hole principle. 

add to both of these strings enough 0s to cause the longer 

1 0s, the two new strings will still 


fc-i 


10 


But then, if 

of these two strings to have exactly k 

e the machine to enter the same state, but one of these strings is in Ak 

and the other is not. Hence, the machine must fail to produce the correct 
accept/reject response on one of these strings. 




kVi 




L' fv 
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1.40 


Let A — {2 n | n > 0 }. The set B 3 (A) consists of all strings of the form 10* 

which is obviously regular. We show by contradiction that B = B 3 (A) isn’t 

regular. Suppose that it is. Then there exists a pumping length p such that 

for any string s E B with |s| > p, we can break s up into strings x, y, and 

z satisfying the conditions of the lemma. Consider the string s E B that is 

the base 3 representation of 2 1 where we choose l large enough to force 

have length at least p. Then s = xyz satisfying the conditions of the lemma. 

In particular, xyz E £, xy 2 z E J 3 , and xz E B . Now, define the length of y 

to be TTi and define the length of z to be n. Recall that vn is strictly greater 

than 0. Treat strings as base 3 representations of numbers in the following 
equations: 


kVl 


S to 


i) xz 


x 3 n 4 - z . (Call this quantity go) 

ii) xyz = x 3 m+n + y 3 n + z . (Call this quantity gi) 

iii) xyyz 


x 3 2m+n + j,3 m+n + j,3 n + 2 . (Call this quantity q 2 ) 


We now derive a contradiction algebraically. Using equations (i), (ii), and 
(iii), we have 


x 3 2m+n + (y- x) 3 m+n 


ga ~ gi 
gi ~ go 






x 3 m +" + ( y - x )3 




We know from the pumping lemma that xz, xyz, xyyz EB so it follows that 

2 io and q 2 = 2 <3 , for some l 0 ,l 2 > 0. Because 
base 3 representation of 2 l , we have qi 




xyz and s is the 

2 l . Moreover, l 0 < l < l 2 . Thus, 




2 l » - 2 l 


2 l 3-io _ 2 ,- *o 

2‘-/o 

Setting these two expression equal we obtain 

2«-J 0 (2« 


2 i ~ i o(2 i 




ga ~ gi 

gi — go 


1) 




2‘ - 2'® 


2 ‘-io 


1 


1 




1) 




and thus 2‘ ,0 (2 <J 






1 ) = 3 m ( 2 <_i ° - 1). 




2 ‘~‘o 


1 


The left-hand side of this equation is an even number and the right-hand 
side of this equation is an odd number. Hence we arrive at a contradiction. 


1.41 


The language D can be written in another way, namely, 0E*0U 1E*1, which 
is obviously regular. 


1.42 


Given an NFA M recognizing A we construct an NFA N accep ting Ai 
using the following idea. M keeps track two states in N using two “fingers”. 

As it reads each input symbol N uses one finger to simulate M on that 
symbol. At the same time it from 
Simultaneously, M 


accept state on a guessed symbol. 

s the other finger to rim M backwards from an accept 

state on a guessed symbol. N accepts whenever the forward simulation and 
the backward simulation 




lie* 


e in the same state, that i 
fingers re together. At those points we axe sure that N has found a string 


whenever the two 
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where another string of the same length can be appended to yield a member 
of A y precisely the definition of j 4 ^_. 

Formally, we assume for simplicity that the NFA M recognizing A has a sin¬ 
gle accept state as guaranteed by Exercise 1 . 9 . Let M = (Q, E, S } go, ^accept)* 
Construct NFA N = (Q', £,< 5 ',go,F') recognizing the first halves of the 

strings in A as follows: 


i) Q' 


QxQ. 

ii) For g,r € Q define £'((g,r),a) = {(ti,v)| u e 5 (g,a) and r € 6 (v y b) for 

some b € E}. 


(QO > ^accept) • 




• • • \ 

Ui) 9 o 
iv) F' 


{( 9 . 9)1 9 e Q}. 


{ 0 *# 1 *}. Thus, j fl {0*1*} = {0 n l n | n > 0 }. Regular sets are 

closed under intersection, and {0 n l n | n > 0} is not regular, so j is not 
regular. 


Let A 


1.43 


{x € {0,1}*| the (n — l)st symbol from the right in x is 1} for 


Let E 

n > 2 . Constructing an n-state NFA for E n is easy. 


1.44 






Let M be a DFA recognizing E n . We claim that M has at least 2 n 1 states. 
Let S be the set of all strings of length n — 1 . For each y in 5 let q y be 

the state that M is in after starting at the start sfttte and the reading y. If 

' ‘ •’ < ' ■•••' \ .r 

y jz Zy then q y ^ q z , otherwise M fails to recognize the E n for the following 

Say that y and z differ on the ith bit. Then one of the strings yO*" 1 
_1 is in E n and the other one isn’t. But M has the same behavior on 
both, so M fails to recognize E n . Hence, M has at least as many states a S 
has strings, and so M has at least 2 n-1 states. 




re 


n. 


WzK 


and z 0 * 


Chapter 2 


e the derivations (but not the parse trees). 


Here 


2.1 


ii 


a. F => T =► F => a 


E+T =► E+T+T = 4 - T+T+T => F+T+T =4- F+F+T =► F+F+F =>■ 
a+F+F =► a+a+F => a+a+a 


c. E 


((F)) =» ((a)) 


((F)) =» ((T)) 


(F) 


d. F => T = 4 > F 


(F) 


(T) 


a. First, we show that both A and B axe context-free. It’s not hard to see that 

the following grammar recognizes A: 




S 


RT 


a R I e 
bTc I e 


R 


T 
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The following grammar recognizes B : 


S 


TR 


aTb | e 

c R | e 


T 




e context-free languages. Now, observe that A OB 


So, both A and B 

{a n b n c n | n > 0 }. We know from Example 2.20 that this language is not 
context-free. We have found two context free languages whose intersection 
is not context-free. Therefore context-free languages are not closed under 
intersection. 




b. First, the context-free languages are closed under the union operation. Let 

(Vi, E, Ri,S\) and G 2 = (V2> E) R2, S 2 ) be two arbitrary context free 
grammars. We construct a grammar G that recognizes their union. Formally, 

G = ( V, E, R, S) where: 

i) F = V1UV2 

ii) R = R 1 UR 2 U{S-*Si, S -¥ S 2 } 

9 

(Here we 
variable name 

Next, we show that the CFGs are not closed under complementation. Assume, 
for a contradiction, that the CFGs are closed under complementa tion. T hen, 
if G 1 a nd G 2 are context free grammars, it would follow that L(G x) and 
L(G 2 ) are context free. We p revious ly showe d that context-free languages 
e closed under union and so L{G\) U L{G\) is context free. That, by our 

assumption, implies that L(G\) U L{G\) is context free. But by DeMorgan’s 

laws, L(G\) U L(G\) = L(G\) flL(G2). However, if G\ and G 2 are chosen 

in part (a), L(G\ ) U L(G{) isn’t context free. This contradiction shows that 
the context-free languages are not closed under complementation. 


G x 


e that R\ and R 2 are disjoint, otherwise we change the 
to ensure disjointness) 








a. The variables of G are: R , X, 5 , T. The terminals of G 

variable is R . 

0 

b. Three strings in G are: ab,ba, and aab. 

c. Three strings not in G sure: a, b, and e. 

d. False. 

e. True. 

f. False. 

g. True. 

h. True. 

► 

i. False. 

j. True. 

k. True. 

l . False. 

m. L(G) consists of all strings over a and b that 


b. The start 








Eli 


-i 




e not palindromes. 
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2.4 a. S-+R1R1R1R 

R -4 OR | 1R | e 

b. 5 -4 0B0 | 1A1 | e 

B -4 Oil | IB | e 

c. 5 -4 0 | 1 | 005 | 015 | 105 | 115 

d. 5 -4 0 j 050 | 051 | ISO | 151 

e. 5 -4 BIB 

B -4 BB | OBI | 1B0 | 1 | e 

This CFG works because B generates all strings that have at least as many 
Is as Os, and so 5 forces an extra 1. 

f. 5 0S0 I 151 | 0 | 1 | * 


S 


g- s 


e its stack. The PDA 


a. This set is regular, so the PDA doesn’t even need to 

scans the string and uses its finite control to maintain a counter which counts 
up to 3. It keeps track of the number of l’s it has seen using this counter. 
The PDA accepts the moment it sees three ones. 

J 

b. This set is also regular. The PDA scans the string and keep track of the 

e, it accepts 








first and last symbol in its finite control. If they 
otherwise it rejects. 

c. Again, this set is regular. The PDA scans the string and keep track of the 

length (modulo 2 ) using its finite control. If the length is 1 (modulo 2) it 

• • 

accepts, otherwise it rejects. 

d. The PDA scans across the string and push the symbols onto the stack. At 

some point it nondeterministically guesses where the middle is. It looks at 
the middle symbol. If that symbol is a 1 , it rejects. Otherwise, it scans the 
rest of the string, and for each character scanned, it pops one element off 
of its stack. If the stack is empty before it reaches the end of the input 

k •**# • • 

it rejects. If the stack is empty when it finishes reading the input (i.e. it 
correctly guessed the middle) then it accepts. 

A scans across the input. If it sees a 1 and its top stack symbol is a 
0 , it pops the stack. Similarly, if it scans a 0 and its top stack symbol is a 
1 , it pops the stack. In all other cases, it pushes the input symbol onto the 
stack. 


e the 


1 












. The 


the input, if there is a 1 on top of the stack, it accepts. 


rvi 


er it 




1 ! 


Otherwise it rejects. 

f. The PDA begins by scanning across the string and pushing each symbol onto 
its stack. At 


e point it nondeterministically guesses when it has reached 
the middle. It also nondeterministically guesses if string has odd length or 
even length. If it guessed even, then it pushes the current symbol it’s reading 
onto the stack (recall that the PDA has guessed that this symbol is the middle 
symbol). If it guesses that string has odd length, it goes to the next input 
symbol without changing the stack. Now, it scans the rest of the string, and 
it compares each symbol it scans to the symbol on the top of the stack. If 
they are the 

different, it rejects. If the stack is empty before it finishes reading all the 


[•ini 


e, it pops the stack, and continues scanning. If they are 


f;*Ml 
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input, it rejects. If the stack becomes empty just after it reaches the end of 
the input then it accepts. In all other cases, it rejects. 

g. The PDA simply rejects immediately. 


SaSaSbS | SaSbSaS | SbSaSaS | e 

XbX*B \ T\U 
aTb | s Tb | b 

aUb | blU | a 
a I b 


a. 5 




b. S 


T 


U 


X 


c. 5 


TX 


T -> 0T0 | lTi | *X 
X OX | IX | e 

d. S -> UPV 

P -» aPa | bPb | T | e 

T #MT | # 

U MnU | e 

V #MV | e 

M -> aM | bM | e 

Note that we need to allow for the case when i = j, that is, some is a 
palindrome. Also, e is in the language since it’s a palindrome. 


a. The PDA uses its stack to keep a count of the number of a’s minus twice the 

number of b’s. It enters an accepting state whenever this count is 0. In more 
detail, it operates as follows. It reads the next input symbol. If it is 
and the top of the stack is a b, it pops the b. If it is an a and the top of is 
either an a or empty, it pushes the a. If the next input symbol is a b, then 
repeat the following sentence twice. If the top if the stack is an a, pop the 
stack, otherwise push the b. 

b. The PDA scans across the string. It pushes the a’s it reads onto the stack 

until it encounters a b. Then, for each b it reads, it pops one a off the stack. 
If the machine encounters an a after a b, it accepts (by entering an accept 
state and moving its head to the end of the input). If the stack becomes 
empty before the end of the input is reached, of if the end of the input is 
reached without the stack becoming empty, the machine accepts. Otherwise 
it rejects. 

c. The PDA scans across the input string and pushes every symbol it reads 

until it reads a #. If # is never encountered, reject. Then, it skips over part 
of the input, nondeterministically deciding when to stop skipping. Then, 
it compares the input symbols it next read with the symbols it pops off 
the stack. At any disagreement, or if the input finishes while the stack is 
nonempty, this branch of the computation rejects. If the stack becomes 

empty, the machine accepts. 

d. The PDA uses its nondeterminism to guess the values of i and j and uses 

its stack to compare with x^. In more detail, first it nondeterministically 
branches to the procedures in the next two paragraphs. 


2.7 




w ii 
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The machine begins by nondeterministically skipping over a part (possibly 
empty) of the input until it arrives at a symbol which is either at the be¬ 
ginning of the input string or is immediately to the right of a #. Then, 
the machine continues to read input symbols, while pushing them onto the 
stack. During this reading and popping phase, two cases arise. In one case, 
if an input # is read, the machine nondeterministically skips to another # in 
the input and continues reading inputs symbols, now comparing them with 
symbols popped off the stack. If all agree and the stack becomes empty at 

the same time 

However, if a disagreement occurs, or the stack becomes empty before either 
the end of the input of # symbol is reached, or either the end of the input of 
# symbol is reached without the stack becoming empty, reject on this branch 
of the nondeterminism. In the other case, during the reading and popping 
phase, the machine may nondeterministically chose to enter the reading and 

popping phase, 
case. 


either the end of the input of # symbol is reached, accept. 




in that 


just described. Then it continues to operate 






Here is one derivation: 

(sentence) 

(noun-phrase) (verb-phrase) =4> 

(cmplx-noun) (verb-phrase) =$• 

(cmplx-noun) (cmplx-verb) (prep-phrase) =s- 
(article) (noun) (cmplx-verb) (prep-phrase) => 

The boy (verb) (noun-phrase) (prep-phrase) =*• 

The boy (verb) (noun-phrase) (prep) (cmplx-noun) =*• 

The boy touches (noun-phrase) (prep) (cmplx-noun) => 
The boy touches (cmplx-noun) (prep) (cmplx-noun) => 
The boy touches (article) (noun) (prep) (cmplx-noun) =* 
The boy touches the girl with (cmplx-noun) =>• 

The boy touches the girl with (ARTICLE) (noun) => 

The boy touches the girl with the flower 

. • * * • - * • 

•••••«■ • , • • . • •» • • 

• I I . I • • 

Here is another derivation: 

(sentence) =s< 

(noun-phrase) (verb-phrase) 

(cmplx-noun) (verb-phrase) 

(article) (noun) (verb-phrase) => 

The boy (verb-phrase) 

The boy (cmplx-verb) =s> 

The boy (verb) (noun-phrase) => 

The boy touches (noun-phrase) =j> 

The boy touches (cmplx-noun)(prep-phrase) => 

The boy touches (art) (noun) (prep-phrase) =$> 

The boy touches the girl (PREP-PHRASE) =» 

The boy touches the girl (prep) (cmplx-noun) 

The boy touches the girl with (cmplx-noun) =► 




[ 
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The boy touches the girl with (ARTICLE) (noun) => 

The boy touches the girl with the flower 

Each of these derivations corresponds to a different English meaning. In the 
first derivation, the sentence means that the boy used the flower in order to 
touch the girl. In the second derivation, the girl happens to be holding the 
flower when the boy touches her. 


A CFG G that generates A is given as follows: 

G = (V } E, R , S). 

V is {S ,EabyEictCy A} and E is {a,b,c}. The rules are 






E ab C | AEbc 

±E ab b | e 
bJEfeC | e 
Cc\e 
i4a I e 




Eab 


Ebc 


C 


A 


Initially substituting E ab C for S generates any string with an equal number 
of as and bs followed by any number of cs. Initially substituting 25^ for S 
generates any string with an equal number of bs and cs prepended by any 
number of as. 

a 

The grammar is ambiguous. Consider the string e. On the one hand, it can 

•t * * * %• 

be derived by choosing E a bC with each of E a \> and C yielding e. On the other 
hand, € can be derived by choosing AEbc with each of A and Eh c yielding e. 
In general, any string a*b* c* with i 
this grammar. 


be derived ambiguously i 


ih si 




Informal description of a PDA that recognizes A in Exercise 2.9: 




1. Nondeterministically branch to either step 2 or step 6. 

2. Read and push &’s. 

3. Read b’s, while popping a’s. 

4. If b’s finish when stack is empty, skip c’s on input and accept. 

on input. 


• Skip 

• Read and push b’s. 

7. Read c’s, while popping b’s. 

• If c’s finish when stack is empty, accept. 










rvi 


i 


Informal description of a PDA that recognizes the CFG in Exercise 2.1: 

1. Place the marker symbol $ and the start variable E on the 

stack. 

. Repeat the following steps forever. 


2.11 
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3 . If the top of stack is the variable E , pop it and nondetermin- 

istically push either J£+T or T into the stack. 

4 . If the top of stack is the variable T, pop it and nondetermin- 

♦ 

istically push either TxF or F into the stack. 

• % 

5 . If the top of stack is the variable JF, pop it and nondetermin- 

istically push either (E) or a into the stack. 

. If the top of stack is a terminal symbol, read the next symbol 

* • 

from the input and compare it to the terminal in the stack. 
If they match, repeat. If they do not match, reject on this 
branch of the nondeterminism. 

7. If the top of stack is the symbol $, enter the accept state. 

Doing so accepts the input if it has all been read. 




The formal definition of the equivalent PDA is (Q,£,r, S,qi,F), where Here 

(,),a}; and T = {£,T,F} U £; F = {g 2 }- The 

transition function S : Q x £ e x T e 


{91,92}; £ = {+, 


Q 






V(Q x T e ) is given as follows. 


{(92, e)} 

{ ( 9 i, E+T) ,(qi,T)} if q 

{( gi , TxF), ( 91 , F)} if 9 
{( 91 , <■£>), (9i,a)} if 9 

{(9i, e)} 


if 9 


e,y 


9i , x 
91,® 
91 , x 

9i,* 

9i,* 




E 


£,y 


T 


S{q,x,y) 


£,y 


F 


e,y 


if 9 




Informal description of a PDA that recognizes the CFG in Exercise 2.3: 

• ''•••» ' » , * / P* . » 

# 11 # H 4 ‘ • * *• / ‘ # • ^ 

1. Place the marker symbol $ and the start variable R on the 

• * 

stack. 

• **#*•• • 

****•*•* 1 

2. Repeat the following steps forever. 

3. If the top of stack is the variable R, pop it and nondetermin- 

istically push either XRX or S into the stack. 

If the top of stack is the variable 5, pop it and nondetermin- 

istically push either aTb or bTa into the stack. 

# • • 

«»*• • •* • 

5. If the top of stack is the variable T, pop it and nondetermin- 

istically push either XTX , X or e into the stack. 

9 

. If the top of stack is the variable X , pop it and nondeter- 

ministically push either a or b into the stack. 

7. If the top of stack is a terminal symbol, read the next symbol 

from the input and compare it to the terminal symbol in the 
stack. If they match, repeat. If they do not match, reject on 
this branch of the nondeterminism. 

8. If the top of stack is the symbol $, enter the accept state. 

Doing so accepts the input if it has all been read. 

-y 

t 

The formal definition of the equivalent PDA is (Q, £, T, <5, q\, F), where Q 

{ 91 , 92 }; £ = {a,h}; T 


2.12 


4. 


: • r- 


L J 


{g 2 }. The transition 


{R, S,T,X} U E; and F 
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V(Q x r c ) is given as follows. 


function < 5 : Q x E e x T e 


{(<12, e)} 

{(q u XRX),( qi ,S)} 

{(gi,arb),(gi,bTa)} 

{(fll,*TX), (ft, *),(«!,«)} if q 

{(9i,*),(9i,*>)} 

{(9i.*)} 


if 9 


e,y 


q u x 

9i,x 
9i,x 
Qi,x 
qi,x 
9i, x 


L. A 


if 9 


R 


e,y 


if 9 


S 


e,y 


6(9, x,y) 


T 


e,y 


if 9 


X 


e,y 


if 9 




2.13 a. L(G) is the set of strings of Os and #s that either contain exactly 2 #s and 

any number of Os, or contain exactly 1 # and the number of Os to the right 
of the # is twice the number of Os to the left. 

b. Assume L(G) is regular. Let A = L(G) H0*#0*. If L(G) is regular, so is 

A. Let p be the pumping length for A given by the pumping lemma for 
regular languages. Consider the string w = 0 p #0 2p . Because |u;| > p and 
w G A, the pumping lemma that w = xyz such that \xy\ < p, y ^ c and 

£ I 

xy x z £ L(G) Vi > 0 . We investigate all possible ways of cutting w and prove 

■ 

that such a cut cannot exist. 

i) x contains the character #. In this case, y is to the right of #. Pumping it 
down makes the number of Os on the right less than twice of the number 

of Os on the left. 

. • r •. * # . 

ii) y contains the character #. Pumping y down makes the resulting string 

contain no #s. 

iii) z contains the character #. In this case, y is to the left of #. Pumping y 

down makes the number of Os on the right more than twice of the number 
of Os on the left. 

In each case the resulting string doesn’t belong to A . Hence A fails the 
pumping lemma, so it cannot be regular, and neither can L(G). 


The equivalent CFG in Chomsky normal form is given as follows. 


2.14 


AB | CC | BA | BD | BB \ e 
AB j CC j BA j BD j BB 


So 


A 


CC 




C 




AB 


D 


Let L(G 1) and L(G 2) be CFGs where G\ and G2 are defined as follows. 
(Here we assume that that the sets of variables are disjoint.) let Gi = 

(Vi,E,fl!, 5 i) and G 2 = (F 2 ,E,ie 2 , 5 2 ). • 

We construct a ne 

new variable that is neither in V\ nor in V^. 

Let Gi 1 = (Vi U V2 U { 5 '}, E, Ri U iZ 2 U {r 0 }, 5 '), where r 0 is 5 ' 


2.15 


CFG Gy where L(G U ) = L{G X ) U L(G 2 ). Let S' be a 


iVi 


Si I s 3 . 


u 
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We construct CFG G 0 that generates L{G \) oL^Gj) as in G u by changing tq 

in Go into S' 

To construct CFG G. that generates the language L(Gi) m , let S' be 
variable not in Vi and make it the starting variable. Let ro be S' -*• S'Si \ e 

be a new rule in G*. 


SiS 2 . 


a new 


Let A be a regular language generated by regular expression R. If R is one 
of the atomic regular expressions b, for b G E, construct the equivalent CFG 
({5} ? {&}, {S -4 b}, S). If R is the atomic regular expression e, construct the 
equivalent CFG ({5}, {&>, {S -> «}, 5). If R is the atomic regular expressions 

construct the equivalent CFG ({5},{1>},{5 S},S). 

If R is a regular expression composed of smaller regular expressions combined 

e the result of Problem 2.15 to construct an 


2.16 


a 


with a regular operation, 
equivalent CFG out of the CFLs that are equivalent to the smaller expressions. 




. Let C be a context-free language and R be a regular language. Let P be the 
PDA that recognizes G, and D be the DFA that recognizes R. If Q is the set 
of states of P and Q 1 is the set of states of D, we construct a PDA P' that 
recognizes G H R with the Set of states P X Q. P' will do what P does and 
also keep track of the states of D. It accepts a string w if and only if it stops 
at a state q 6 Fp x F D , where F P is the set of accept states of P and F D is 
the set of accept states of D. Since G D R is recognized by P', it is context 


2.17 a 


kVl 


free. 

b. Let R be the regular language a*b*c*. If A were A a CFL then AC\R would 

n > 0} and Example 2.20 


be a CFL by part (a). However, A fl R = {a n b 
proves A fl R is not context-free. Thus A is not a CFL. 


n -n 




> 0}. Let p be the pumping length given by the 

0 P 1 P 0 P 1 P cannot be pumped. Let 


2.18 a. Let A = {0 n l n 0 n l 

pumping lemma. We show that s 






uvxyz . 

If either v or y contain more than one type of alphabet symbol, uv 2 xy 2 z does 
not contain the symbols in the correct order. Hence it cannot be a member of 
A. If both v and y contain (at most) one type of alphabet symbol, uv 2 xy 
contains runs of 0*s and l’s of unequal length. Hence it cannot be a member 
of A . Because s cannot be pumped without violating the pumping lemma 

conditions, A is not context free. 

b. Let B = 

pumping 
pumped. 

Neither v nor y can contain #, otherwise xv 2 wy 2 z contains more than two #s. 
Therefore, if we divide s into three segments by #’s: 0 P , 0 2p , and 0 3p , at least 
one of the segments is not contained within either v or y. Hence, xv 2 wy 2 z is 
not in B because the 1: 2 : 3 length ratio of the segments is not maintained. 








{0 n #0 2n #0 3n | n > 0}. Let p be the pumping length given by the 

0 p #0 2p #0 3p . We show that s = uvxyz cannot be 


lemma. Let 8 
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c. Let C = {tu#x| id is a substring of x, where w,x G {a,b}*}. Let p be the 

pumping length given by the pumping lemma. Let s = a p b p #a p b p . We show 
that the string a = uvxyz cannot be pumped. 

ft 

Neither v nor y can contain #, otherwise uv®xy®z does not contain # and 
therefore is not in C. If both v and y are nonempty and occur on the left- 
hand side of the #, the string uv 2 xy 2 z cannot be in C because it is longer on 
the left-hand side of the #. Similarly, if both strings occur on the right-hand 
side of the #, the string uv°xy°z cannot be in C, because it is again longer on 
the left-hand side of the #. If only one of v and y is nonempty (both c ann ot 
be nonempty), treat them as if both occurred on the same side of the # 
above. 

The only remaining case is where both v and y are nonempty and straddle 

the #. But then v consists of b’s and y consists of a’s because of the third 

pumping lemma condition \vxy\ < p. Hence, uv 2 xy 2 z contains more b’s on 
the left-hand side of the # 

d. Let D = {xi#X2# • * * #£fc| fc > 2, each Xi € {a,b}*, and for some x<=Xj}. 

Let p be the pumping length given by the pumping lemma. Let a = a p b p #a p b p . 

We show that a = uvxyz cannot be pumped. Use the same reasoning as in 
part (c). 




it cannot be a member of C. 






Notes The version of the problem that appears in the first printing is in¬ 
correct. The correct version (which appears in the second and subsequent 
printings) is: Show that the language F = {aVc fc | i, j, k > 0 and if i = 1 
then j = k} satisfies the three conditions of the pumping lemma even though 
it is not regular. Explain why this fact does not contradict the pumping 
lemma. 

Every rule of a Chomsky normal form grammar is of the form 
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A 


BC 


A 




(except possibly for the rule S —> e, where S is the start variable. This rule 
isn’t used in the derivation of w because n > 1.) 

Consider a derivation of w. Each application of a rule of the form A -+ BC 
increases the length of the string by 1. So we have n — 1 steps here. Besides 
that, we need exactly n applications of terminal rules {A —> a) to convert the 
variables into terminals. Therefore, exactly 2n — 1 steps are required. 


Note: The version of the problem that appears in the first printing is in- 

» i , • 

correct. The correct version (which appears in the second and subsequent 

printings) replaces the phrase u more than b steps” with the phrase “at least 
2 b steps”. 

Let G be a CFG in Chomsky normal form that contains b variables. Assume 
G generates a string w using a derivation with at least 2 b steps. Let n be 

the length of w. By the results of Problem 2 . 20 , n > 2!±i > 2*-i. 
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The right-hand side of each rule contains at 


Consider a parse tree of w. 
most two variables, so each node of the parse tree has at most two children. 

Additionally, the length of w is at least 2 6 , so the parse tree of w must have 

height, at least b +1 to generate a string of length at least 2 b . Hence, the tree 

™ntaing a path with at least 6 + 1 variables, and therefore some variable is 

repeated on that path. Using a surgery on trees argument identical to the 

one used in the proof of the CFL pumping lemma, we can now divide w into 

pieces uvxyz where uv'xy'z 6 G for all t > 0. Therefore, L[G) is infinite. 


a. To see that G is ambiguous, note that the string 
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:-l 


11 condition then if condition then a:*i else 


has two different leftmost derivations (and hence parse trees): 

(stmt) 


(if-then) 

if condition then (stmt) 

if condition then (IF-THEN-ELSE) 

if condition then if condition then (stmt) else (STMT) 
if condition then if condition then a:*l else (stmt) 
if condition then if condition then a:=l else a:*l 


and 


(stmt) 


(if-then-else) 

if conditon then (stmt) else (stmt) 
if conditon then (if-THEN) else (stmt) 
if conditon then if condition then (stmt) else (stmt) 
if condition then if condition then a:*l else (stmt) 

if condition then if condition then a:*l else a:=l 


b. The amb iguity in part a) arises because the grammar allows matching 

else both to the nearest and to the farthest then. To avoid this ambigu- 

grammar that only permits the nearest match, by 

else. 




ity we construct a ne 

disabling derivations which introduce an (if-then) before 

variables: (e-stmt) and (e-if-then-else) , which 

work just like their non-(E) counterparts except that they cannot generate 

the dangling (if-then). The rules of the new grammar 
the old grammar except we remove the (if-then-else) rules and add the 

following new rules: 


kVi 




r :lil 


grammar has two ne 


kYJ 


for 


e the 




♦Vi 






(assignment) | (begin-end) | (e-if-then : else) 

if conditon then (e-stmt) else (e-stmt) 

if conditon then (e-STMT) else (stmt) 


(e-stmt) 

(e-if-then-else) 

(if-then-else) 
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L(G) is the set of strings of Os and #s that either contain exactly two #s and 
any number of Os, or contain exactly one # and the number of Os to the right 
of the # is twice the number of Os to the left. First we show that three is 
not a pumping length for this language. The string 0#00 has length at least 
three, and it is in L(G). It cannot be pumped using p = 3, because the only 
way to divide it into uvxyz satisfying the first two conditions of the pumping 

lemma is u = z = e, v = 0, x = #, and y = 00, but that division fails to 
satisfy the third condition. 

Next, we show that 4 is a pumping length for L(G). If w £ L(G) has length 
at least 4 and if it contains two #s, then it contains at least one 0. Therefore, 
by cutting w into uvxyz where either v or y is the string 0, we obtain a way to 

pump w. If w E L(G) has length at least 4 and if it contains a single #, then 
it must be of the form 0*#0 2fc for some k > 1. Hence, by assigning 

v = o, x = #, y = 00, and z 

lemma. 


2.22 


o*-\ 

0 2fc ~ 2 , we satisfy all three conditions of the 




Let F = {a t b J c fc d l | i, j, A:, / > 0 and if i 
free because F n ab*c*d 
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1 then j = k = /}. F is not context 
{ab n c n d n | n > 0}, which is not a CFL by the 
pumping lemma, and the intersection of a CFL and a regular language is a 
CFL (see Problem 2.17). However, F does satisfy the pumping lemma with 
p = 2. (p = 1 works, too, with a bit more effort.) 


If s E F has length 2 or more then: 


i) If a E b*c*d* write a as s 
where u = r, u = p, x = y 

ii) If s E ab*c*d* write s 

v ss a, x = y = c, z = t. 

iii) If s € aaa*b*c*d* write 8 as s = aa t and divide a = uvxyz where u = e 

y = e, z = t. 

In each of the three cases, we may easily check that the division of 8 satisfies 
the conditions of the pumping lemma. 


rgt for g E {6, c, d} and divide 8 
e.z = t. 


uvxyz 


a t and divide s = uvxyz where u = e 














Let G be a CFG generating A . Let p be the pumping length for G given 
by the pumping lemma for CFGs. Let k 


2.24 


p(p - l)(p- 2)--«l. Let 

8 = a fc b*c*. We show that 8 has two distinct parse trees in G. To do so we’ll 
also consider the following two strings in A y 8 % = a*b p c p and 82 = a^l^c* 
with parse trees r\ and T 2 . 


pi 


In T\ } remove all nodes that have only a’s below them. The remaining subtree 
has 2p leaves that are only b’s and c’s. Because 2p > p we know that the 
subtree contains a path with a repeated has variable i?, using the s 
argument that appears in the proof of the pumping lemma. Again by the 
same argument 

that \vxy\ < p. Strings v and y can each only contains one type of symbol; 
otherwise uv 2 xy 2 z is not in A. Furthermore, y contains no a’s, because R 




'SIM 


e R to divide 8 into 8 


uvxyzj and R can be chosen so 


m 
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on a path to a b or c. Hence v must be b’s and y must be c’s and they 

integer 

8 has a parse tree where most 
on with c’s but not with a’s. Repeating this 


wa 




k/l (which must be 


equal length /. Letting d 


must have 

because l < p and k = p!) the string vu d xy d 
b’s have a parent node in co: 
argument with 7*2 we obtain a parse tree for a where most b’s have a parent 
node in common with a’s but not with c’s. Hence a haw two distinct parse 


>11 




llllll 


trees, so G is ambiguous. 


The grammar generates all strings not of the form a fc b fc for k > 0. Thus 
the complement of the langu age g enerated is L(G) = {a fc b* 
following grammar generates L(G): {{5}, {a,b}, {5 —► aSb | e},5}. 


2.25 


k > 0}. The 


Let C = {x#y\ x,y e {0, l}* amd x ^ y}. We construct a PDA P that recog¬ 
nizes C. It operates by guessing corresponding positions on which the strings 

x and y differ 

some symbols, say Is, onto the stack. At some point it nondeterministically 
guesses a position in x and it records the symbol it is currently reading there 
in its finite memory amd skips to the #. Then it pops the stack while reading 
symbols from the input until the stack is empty and checks that the symbol 
it is now currently reading is different from the symbol it had recorded. If 
so, it accepts. 

Here is a more detadled description of P’s algorithm. If something goes wrong, 
for example, popping when the stack is empty, or getting to the end of the 
input prematurely, P rejects on that branch of the computation. 

1. Read next input symbol, amd push 1 onto stack. 

2. Nondeterministically jump to either 1 or 3. 

3. Record the current input symbol in the finite control. For con¬ 
venience call this symbol a. 

4. Read input symbols until # is read. 

5. Read the next symbol, and pop the stack. 

. ■ • .. * **• i ■ • • • 

. If stack is empty, go to 7, otherwise go to 5. 

7. Accept if the current input symbol differs from a, otherwise 

reject . 
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e time as it pushes 


follows. It reads the input at the s 


Tin 


f;T 


r* 


Let D = {xy\ x,y G {0,1}* and |x| = \y\ but x ^ y}. We construct a PDA 
P recognizing D . This PDA guesses corresponding places on which x and y 
differ. Checking that the places correspond is tricky. Doing so relies on the 
observation that the two corresponding places aire n/2 symbols apart, where 

n is the length of the entire input. Hence, by ensuring that the number of 

& 

symbols between the guessed places is equal to the number other symbols, 
the PDA cam check that the guessed places do indeed correspond, 

Here we give a more detailed description of the PDA adgorithm. If something 
goes wrong, for example, popping when the stau:k is empty, or getting to the 
end of the input prematurely, P rejects on that branch of the computation. 


2.27 
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1. Read next input symbol and push 1 onto the stack. 

2. Nondeterministically jump to either 1 or 3. 

3. Record the current input symbol in the finite control. For con¬ 
venience call this symbol a. 

4. Read next input symbol and pop the stack. Repeat until stack 

is empty. 

5. Read next input symbol and push 1 onto the stack. 

6. Nondeterministically jump to either 5 or 7. 

7. Reject if current input symbol differs from a. 

• Read next input symbol and pop the stack. Repeat until stack 
is empty. 

9* Accept if input is empty. 

Alternatively we can give a CFG for this language 


r 


follows. 


! :T 


AB I BA 
XAX | 0 
XBX | 1 


S 


A 




X 


0 1 


Let G = (E, V , T, P, S) be a context free grammar for A. Define r to be the 
length of the longest string of symbols occurring on the right hand side of 
any production in G. We set the pumping length k to be r 2 l v l +1 . Let a be 
any string in A of length at least k and let T be a derivation tree for A with 
the fewest number of nodes. Observe that since s > fc, the depth of T must 
be at least 2\V\ + 1 . Thus, some path p in T has length at least 2\V\ + 1. By 
the pigeonhole principle, some variable V' appears at least three times in p. 
Label the last three occurrences of V f in p as Vi, Vq, and V 3 . Moreover, define 
the strings generated by Vi, V 2 , and V 3 in T as si, a% } and S 3 respectively. 

Now, each of these strings is nested in the previous because they are all being 

* * * • • 

generated on the same path. Suppose then that a\ = Z 1 S 2 P 1 and a 2 = h^r?. 
We now have three cases to consider: 

i) \hh\ > 1 and |rir 2 1 > 1: In this case, there is nothing else to prove since 
we can simply pump in the usual way. 

ii) IZ 1 Z 2 I = 0: Since we defined T to be a minimal tree, neither 7*1 nor 1*2 can 

be the empty string. So, we can now pump r\ and 7*2. 

iii) | 7 *i 7 * 2 | = 0: This case is handled like the previous one. 


2.28 


Chapter 3 


3.1 


«-*9a u ) uu< Zacc«pt 

b. qi 00, ug 2 0, uxg 3 u, ug 5 xu, 45UXU, ugjxu, uxg 2 u, uxug accept 

* 

c. 91OOO, ug 2 00 , UX93O, uxOg.u, ux 0 ug re ; ect 
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d. qi 000000, ug 2 00000, uxg 3 0000u, ux0g 4 000, ux0xg 3 00, ux0x0g 4 0, 

ux0x0xg 3 u, ux0x0g 5 xu, ux0xg 5 0xu, ux0g 5 x0xu, uxg 5 0x0xu, ug 5 x0x0xu, 
gsuxOxOxu, ug 2 x0x0xu, uxg 2 0x0xu, uxxg 3 x0xu, uxxxg 3 0xu, uxxx0g 4 xu, 

uxxx0xg 4 u, uxxxOxLig re jeet 


a. gill,Lig 3 l,ulg 3 u,ulug reject 

b. gxl#l, ug 3 #l, u#g 5 1 , u# 1 g 5 li , u#g 7 lu, ug 7 #iu, g 7 u#lu, ug 9 #lu, u#g n lu, 

ugi 2 #x u , gx 2 U#xu, ugx 3 #xu, u#gx 4 xu, u#xgx 4 u, u#xug accept 

c. gxl##l,ug 3 ##l,u#g 5 #l,u##g re j ect l 

d. gil0#ll,ug 3 0#ll,u0g 3 #ll,u0#g5ll,u0#lg5l,u0#llg5u,u0#lg 7 lu, 

u0#g 7 llu,u0g 7 #llu, ug 7 0#llu,g 7 u0#liu, ug 9 0#llu,Lj0g 9 #llu,u0#giillu, 
u0gx 2 #xlu, ugx 2 0#xlu, gx 2 u0#xlu, ug 13 0#xlu, uxg 8 #xlu, ux#gi 0 xlu, 
UX#xgiolu, UX#X 1 g te ject u 

e. gxl0#10, ug 3 0#10, u0g 3 #10, u0#g 5 10, u0#lg 5 0, u0#10g 5 u, u0#lg 7 0u, 

u0#g 7 10u, u0g 7 #10u, ug 7 0#10u, g 7 u0#10u, ug 9 0#10u, u0g 9 #10u, u0#gul0u, 
u0gx 2 #x0u, ugx 2 0#x0u, gi 2 u0#x0u, ugx 3 0#x0u, uxg 8 #x0u, ux#gxoxOu, 
ux#xgxoOu, ux#gx 2 xxu, uxqi 2 #xxu, ugi 2 x#xxu, g 12 ux#xxu, ugx 3 x#xxu, 
uxgx 3 #xxu, ux#gx 4 xxu, ux#xgx 4 xu, ux#xxgx 4 u, ux#xxuq accept 




If a language L is decidable, it can be decided by a deterministic TM and 
that is automatically a nondeterministic TM. 

If a language L is decided by a nondeterministic TM iV, we construct a 
deterministic TM 2?2 that decides L. TM D 2 uses the same algorithm as in 
the TM D described in the proof of Theorem 3.10, with an additional step 
5 : Reject if all branches of nondeterminism of N axe exhausted. 

We argue that D 2 is a decider for L. If N accepts, D 2 will eventually find 
accepting branch and accept, too. If N rejects, all of its branches halt and 
reject. Therefore, by the theorem on trees given in the exercise, its entire 
computation tree is finite, and D will halt and reject when this entire tree 

has been explored. 
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An enumerator is a 7-tuple (Q, £, T, 6, qo, Sprint > ^accept)> where Q, E, T are all 
finite sets and 

i) Q is the set of states, 

ii) T is the work tape alphabet, 

iii) E is the output tape alphabet, 

iv) 6: Q x T —>Q xT x { L,R} x E c is the transition function, 

v) qo €Q is the start state, 

vi) g P rint G Q is the print state, and 

* • 

vii) gaccept € Q is the reject state, where g pr i nt # ^reject- 
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• 1 


The computation of an enumerator E is defined as in an ordinary TM, except 
for the following points. It has two tapes, a work tape and a print tape, both 
initially blank. At each step, the machine may write a symbol from E on the 
output tape, or nothing, as determined by J. If 6(q,a) = (r, 6,L,c), it 
that in state q , reading a, enumerator E enters state r, writes b on the work 
tape, moves the work tape head left (or right, if L had been R), writes c on 
the output tape, and moves the output tape head to the right ifc ^ e. 

Whenever state q pT int Is entered, the output tape is reset to blank and the 
head returns to the left-hand end. The machine halts when ^accept is entered. 
L(E) = {w € E*| w appears on the work tape if g pr int is entered}. 


itiuRViK 


a. Yes. T is the tape alphabet. A Turing Machine can write any characters in 

r on its tape, and u 6 T according to the definition. 

b. No. E never contains u but T always contains u. So, they cannot be equal. 

c. Yes. If the Turing Machine tries to move its head off the left-hand end of the 

tape, it remains on the same tape cell. 

d. No. Any Turing Machine must contain two distinct states Accept and Reject • 

So, a Turing Machine contains at least two states. 




In Stage 2 of this algorithm: “Run M on Si , 

f 

Si, E would not check any inputs after s». If that were to occur, E might fail 
to enumerate L(M) as required. 


if M loops on a certain input 




3. 


M 


Xfc have infinitely may possible settings. A Turing 


The variables xi 

machine would required infinite time to try them all. But, we require that 
every stage in the Turing machine description be completed in a finite number 
of steps. 


3-7 




• • • 




a. “On input string w: 

1. Scan the tape and mark the first 0 which has not been marked. 

If there is no unmarked 0, go to stage 4. Otherwise, move the 
head back to the front of the tape. 

2 . Scan the tape and mark the first 1 which has not been marked. 

If there is no unmarked 1, reject . 

3. Move the head back to the front of the tape and repeat stage 

1 . 

4. Move the head back to the front of the tape. Scan the tape 

to see if any unmarked Is remain. If there are none, accept . 
Otherwise, reject . 

b. “On input string w: 

1 . Scan the tape and mark the first 0 which has not been marked. 

# 

If there is no unmarked 0, go to stage 4. 
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2 . Move on and mark the next unmarked 0. If there is not any on 

the tape, reject . Otherwise, move the head back to the front 
of the tape. 

. Scan the tape and mark the first 1 which has not been marked. 
If there is no unmarked 1, reject . 

4. Move the head back to the front of the tape and repeat stage 




1 . 


5 . Move the head back to the front of the tape. Scan the tape 

to see if there are any unmarked Is. If there is not, accept . 
Otherwise, reject . 

c. “On input string w: 

1 . Scan the tape and mark the first 0 which has not been marked. 

If there is no unmarked 0, go to stage 4. 

2 . Move on and mark the next unmarked 0. If there is not any on 

the tape, accept . Otherwise, move the head back to the front 
of the tape. 

• Scan the tape and mark the first 1 which has not been marked. 
If there is no unmarked 1, accept . 

4. Move the head back to the front of the tape and repeat stage 




k- 


1 . 


5 . Move the head back to the front of the tape. Scan the tape 

to see if there are any unmarked Is. If there is not, reject . 
Otherwise, accept . 




a. By Example 2.20, no PDA recognizes B = {a n b n c n | n > 0}. The following 

2 -PDA recognizes B . Push all the a’s that appear in the front of the input 
tape to stack 1. Then push all the b’s that follow the a’s in the input stream 
into stack 2. If it sees any a’s at this stage, reject. When it sees the first 
c, pop stack 1 and stack 2 at the same time. After this, reject the input if 
it sees any a’s or b’s in the input stream. Pop stack 1 and stack 2 for each 

• .p 

% i * • p m • 

input character c it reads. If the input ends when both stacks are empty, 
accept . Otherwise, reject . 

.r • >•*. * 

• • 

b. We show that a 2-PDA can simulate a TM. Furthermore, a 3-tape NTM can 

ordinary deterministic, 1-tape TM can simulate 






simulate a 3-PDA, and 
a 3-tape NTM. Therefore a 2-PDA can simulate a 3-PDA, and so they 






►STS I 


equivalent in power. 

The TM by 2-PDA simulation is done as follows. We split the tape of a TM 
into two stacks. Stack 1 stores the characters on the left of the head, with 
the bottom of stack storing the leftmost character of the tape in the TM. 
Stack 2 stores the characters on the right of the head, with the bottom of 
the stack storing the rightmost character on the tape in the TM. In other 
words, if a TM configuration is aqib, the corresponding 2-PDA is in state 
with stack 1 storing the string a and stack 2 storing the string b R , both from 
bottom to top. 
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For each transition S(qi,Ci) = (qj , Cj , L) in the TM, the corresponding PDA 
transition pops c« off stack 2, pushes Cj into stack 2, pops stack 1 and pushes 

4 

the character into stack 2, and goes from state to qj. For any transition 
£(&>£») = in the TM, the corresponding PDA transition pops c* off 

stack 2 and takes away c*, push Cj into stack 1, and goes from state to qj . 


ordinary TM by a write-twice TM. The write-twice TM 
simulates a single step of the original TM by copying the entire tape over to 
a fresh section of the tape to the right of the currently used section. The 
copying procedure operates character by character, marking a character 
it is copied. This procedure alters each tape square twice, once to write the 
character for the first time and another time to mark that it is copied. The 
position of the original TMs tape head is marked on the tape. When copy¬ 
ing the cells around the the marked position, the tape contents is updated 
according to the rules of the TM. 

To carry out the simulation with a write-once TM, operate as before, except 
that each cell of the tape is represented by two cells. The first of these 
contains the original TM’s tape contents and the second is for the mark used 
in the copying procedure. The input is not presented to the machine in the 
format with two cells per symbol, so the very first time the tape is copied, 
the copying marks are put directly over the input symbols. 


We first simulate 






:T; 


A Turing machine with doubly infinite tape can easily simulate an ordinary 
Turing Machine. It needs to mark the left-hand end of the input so that it 
can prevent the head from moving off of that end. 

To simulate the doubly infinite tape TM by an ordinary TM, we show how 
to simulate it with a 2-tape TM, which was already shown to be equivalent 
in power to an ordinary TM. The first tape of the 2-tape TM is written with 
the input string and the second tape is blank. We cut the tape of the doubly 
infinite tape TM into two parts, at the starting cell of the input string. The 
portion with the input string and all the blank spaces to its right appears on 
the first tape of the 2-tape TM. The portion to the left of the input string 
appears on the second tape, in reverse order. 


3.11 


We simulate an ordinary TM with a reset TM has only the RESET and R 
operations. When the ordinary TM moves its head right, the reset TM does 
the same. When the ordinary TM moves its head left, the reset TM cannot, 
so it gets the same effect by marking the current head location on the tape, 
then resetting and copying the entire tape one cell to the right, except for 
the mark, which is kept on the same tape cell. Then it resets again, and 
scans right until it find the mark. 


3.12 
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We simulate this TM variant by an NFA. The NFA only needs to keep track 
of the TM’s current state and what the TMs written on the current tape cell. 
Remembering what it has written on tape cells to the left of the current head 
position is unnecessary, because the TM is unable to return to these cells an 
read them. Using an NFA in the actual construction is convenient because it 
allows e moves which are useful for simulating the “stay put” TM transitions. 

More formally, the NFA has \Q x T| + 3 states. The first |Q x T\ states 

correspond to the state of the TM variant and the character on the tape 

% 

that the head of the TM points to. The extra states are the start state 

fttart; a special accept state Accept and a special reject state Reject- The 
transition function S' for the NFA is constructed according 5. First we set 

6* (cfetart> p) = {go P }j where go is the start state of the TM variant. Next, we 
set ^(gaccept* 0 == {g'accept} for any i. 

If 5(p,a) = (qAccepty b, w) } where w = R or S, we set 6'(q pa) e) = {g aC cept}- If 
6(p y a) = (g r eject) b 1 w) ) where w = R or S, we set £'(g po ,e) = {^reject}- 


.13 




a. For any two decidable languages L\ and L 2 , let M\ and M 2 be the TMs that 

decide them. We construct a TM M 1 that decides the union of L\ and L 2 : 


3.14 


“On input w: 


Run Mi on w, if it accepts, accept. 

Rim M 2 on u>, if it accepts, accept. Otherwise, reject. 

M ' accepts w if either M\ or M 2 accepts it. If both reject, M' rejects. 

b. For any two decidable languages L\ and L 2 , let Mi and M 2 be the TMs that 

decide them. We construct a NTM M' that decides the concatenation of L\ 

and Z /2 • 


1 . 




2 . 


“On input w : 

1. For each way to cut w into two parts w = W 1 W 2 '. 

Run Mi on w\. 

Run M 2 on W 2 - 

If both accept, accept. Otherwise, continue with the next 

u>i, U> 2 - 

5. All cuts have been tried without success, so reject. 

We try every possible cut of w. If we ever come across a cut such that the 
first part is accepted by Mi and the second part is accepted by M 2 , w is in 
the concatenation of L\ and L 2 . So M' accept w. Otherwise, w does not 
belong to the concatenation of the languages and is rejected. 

c. For any decidable language L, let M be the TM that decides it. We construct 

a NTM M' that decides the star of L: 

“On input w: 

1. For each way to cut w into parts so that w 

Run M on for t = 1,2 

string , accept. 

3. All cuts have been tried without success, so reject. 


2 . 




4. 




W 1 W 2 . ..w n : 
n. If M accepts each of these 




• • • 
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If there is a way to cut w into different substrings such that every substring is 
accepted by M, w belongs to the star of L and thus M' accepts w . Otherwise, 
w is rejected. Since there are finitely many possible cuts of w , M' will halt 
after finitely many steps. 

d. For any decidable language L, let M be the TM that decides it. We construct 

a TM M' that decides the complement of L: 

“On input w: 

1. Run M on w. If M accepts, rejects if M rejects, accept . 

Since M' does the opposite of whatever M does, it decides the complement 
of L. 

e. For any two decidable languages L\ and Li, let Mi and M 2 be the TMs that 

decide them. We construct a TM M' that decides the intersection of L\ and 




1/2 • 


“On input w: 

1. Run Mi on w ) if it rejects, reject . 

2. Run M 2 on w , if it accepts, accept . Otherwise, reject . 

M' accepts w if both Mi and M 2 accept it. If either of them rejects, M' 
rejects w , too. 




a. For any two Turing-recognizable languages L\ and L 2 , l et Mi and M 2 be the 

TMs that recognize them. We construct a TM M' that recognizes the union 

of L\ and L 2 : 

“On input w: 

1. Run Mi and M 2 alternatively on w step by step. If either 

accept, accept . If both halt and reject, then reject . 

If any of Mi and M 2 accept w, M' will accept w since the accepting TM will 
come to its accepting state after a finite number of steps. Note that if both 
Mi and M 2 reject and either of them does so by looping, then M will loop. 

b. For any two Turing-recognizable languages L\ and L 2 , let Mi and M 2 be 

the TMs that recognize them. We construct a NTM M ; that recognizes the 
concatenation of L\ and L 2 ' 

“On input w: 

1. Nondeterministically cut w into two parts w 

2. Run Mi on w 1 . If it halts and rejects, reject . If it accepts, go 

to stage 3. 

3. Run M 2 on W 2 - If it accepts, accept . If it halts and rejects, 

reject . 

If there is a way to cut w into two substrings such Mi accepts the first part 
and M 2 accepts the second part, w belongs to the concatenation of L\ and 
L 2 and M' will accept w after a finite number of steps. 

c. For any Turing-recognizable language L, let M be the TM that recognizes it. 

We construct a NTM M' that recognizes the star of L : 

“On input w: 


3.15 




W\W2- 
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1. Nondeterministically cut w into parts so that w 

2. Run M on for all i. If M accepts all of them, accept . If it 

halts and rejects any of them, reject . 

If there is a way to cut w into substrings such M accepts the all the substrings, 
w belongs to the star of L and M 1 will accept w after a finite number of steps. 

d. For any two Turing-recognizable languages L\ and L 2 , let Mi and M 2 be 

the TMs that recognize them. We construct a TM M ' that recognizes the 
intersection of L\ and L 2 : 

“On input w: 

1. Rim Mi on w. If it halts and rejects, reject . If it accepts, go 

to stage 3. 

2. Run M 2 on w. If it halts and rejects, reject . If it accepts, 

accept . 

If both of Mi and M 2 accept tu, w belongs to the intersection of L\ and L 2 
and M' will accept w after a finite number of steps. 


W\W2 * * * W n . 






If A is decidable, the enumerator operates by generating the strings in lexico¬ 
graphic order and testing each in turn for membership in A using the decider. 
Those strings which are found to be in A are printed. 

If A is enumerable in lexicographic order, we consider two cases. If A is finite, 
it is decidable because all finite languages are decidable. If A is infinite, a de¬ 
cider for A operates as follows. On receiving input tu, the decider enumerates 
all strings in A in order until some string lexicographically after w appears. 
That must occur eventually because A is infinite. If w has appeared in the 
enumeration already then accept , but if it hasn’t appeared yet, it never will, 
so reject . 

Note: Breaking into two cases is necessary to handle the possibility that 
the enumerator may loop without producing additional output when it is 
enumerating a finite language. As a result, we end up showing that the 
language is decidable but we do not (and cannot) algorithmically construct 
the decider for the language from the enumerator for the language. This 
subtle point is the reason for the star on the problem. 


.16 


3.17 


We give a DFA A that is equivalent to TM M. Design A in two stages. 
First, we arrange the states and transitions of A to simulate M on the read¬ 
only input, portion of the tape. Second, we assign the accept states of A to 
correspond to the action of M on the read/write portion of the tape to the 
right of the input. In the first stage, we construct A to store a function 

(Q U{acc,rej}) 

in its finite control where Q is the set of M’s states and s is the string that 
it has read so far. Note that only finitely many such functions exist, so we 
assign one state for each possibility. 


F a : (Q U {first}) 




The function F, contains information about the way M would compute on a 
string s. In particular, F, (first) is the state that M enters when it is about 
to move off of the right end of a for the first time, when M is started in its 
starting state at the left end of a. If M accepts or rejects (either explicitly or 
by looping) before ever moving off of the right end of s, then F s (first) = acc 
rej accordingly. Furthermore, for F a (q) is the state M enters when it is 
about to move off of the right end of a for the first time, when M is started 
in state q at the right end of s. As before, if M accepts or rejects before ever 
moving off of the right end of s, then F 9 (q) = acc or raj. 

Observe that, for any string a and symbol a in S, we can determine F sa 
from a, F $ , and Af’s transition function. Furthermore, Ft is predetermined 
because F$ (first) = q$ % the start state of Af, and Fg(q) = q (because TMs 
that attempt to move off the leftmost end of the tape just stay in the left- 
hand cell). Hence we can obtain A’s start state and transition function. That 
completes the first stage of A’s design. 

To complete the second stage, we assign the accept states of M. For any 

two strings a and t, if F, = Ft, then M must have the 
t —either both a and t are in L(M) or both are out. Hence A’s state at the 
end of a is enough to determine whether AT accepts s. More precisely, we 
declare a state of A to be accepting if it was assigned to a function F that 
equals F, for any string a that M accepts. 




e output on a and 


t il 


The language A is one of the two languages, {0} or {1}. In either case the 

e not able to determine which 

n’t be able to describe the decider for A 






language is finite, and hence decidable. If we 

of these two languages is A, we 
but we can give two TM s, one of which is A’s decider. 
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4.1 
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ICJ 






> 


ICI 




. No 


. Ye 


ii 


{{A, R)\ A is a DFA ,R is a regular expression and L(A) 


Let £Qdfa,rex 
L(R)}. The following TM E decides ^Qdfa.rex- 




E = “On input (A,R): 

S - ’ . 

1. Convert regular expression A to an equivalent DFA B using 

the procedure given in Theorem 1.28. 

2. Use the TM C for deciding EQ 0FA given in Theorem 4.5, on 

input ( A,B ). 
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3. If R accepts, accept. If R rejects, reject. 




4.3 


Let ALLqfa = {( A)\ A is a DFA that recognizes 
decides ALLofa- 

“On input (A) where A is a DFA: 



♦ 



The following TM 




1 . 




Construct DFA B that recognizes L(A) as described in Exer¬ 
cise 1.10. 

R un TM T from Theorem 4.4 on input (£), where T decides 

^DFA* 

f 

If T accepts, accept. If T rejects, reject .” 


4.4 


Let -Aecfg = {(G) | G is a CFG that generates 
decides AeqfQ' 

“On input ( G) where G is a CFG: 



The following TM V 



1 . 


Run TM S from Theorem 4.6 on input (G, e), where 5 is a 
decider for Acfg- 


2. If S accepts, accept. If S rejects, reject. 


99 


4.5 





Let INFINITE of a = {(A) | L(A) is an infinite language}. The following TM 
I decides INFINITE dfa* 

“On input (A) where A is a DFA: 

1. Let k be the number of states of A. 

Construct a DFA D that accepts strings of length > k. 

Construct a DFA M such that L(M) = L(A) fl L(D). 

Run TM T from Theorem 4.4 on input (Af), where T decides 

-Bdfa- 

If T accepts, reject. If T rejects, accept 

If A accepts a string with length at least fc, this string can be must cause 

A to enter the same state at least twice, so it can be pumped up to obtain 

• • #' • • • 

... % 

many strings that are accepted by A. Conversely, if A accepts 
infinitely many strings, it must accept strings of arbitrarily long lengths, in 
particular, a string of length at least k . 





4.6 a. g is one-to-one. / is not one-to-one because /(l) = /(3). 

b. g is onto. / is not onto because there does not exist x € X such that 

f(x) = 10. 

c. g is a correspondence because g is one-to-one and onto. / is not a correspon¬ 
dence because / is not one-to-one and onto. 
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Suppose B is countable and a correspondence /: Af — >B exists. We construct 

x in B that is not paired with anything in Af. Let x = £ 1 X 2 -Let = 0 if 

f(i)i = 1 , and X{ = 1 if f(i)i = 0 where f(i)i is the ith bit of f(i). Therefore, 
we ensure that x is not f(i) for any i because it differs from f(i) in the ith 
symbol, and a contradiction occurs. 


4-7 


2 , 3 J 5* 5 . Function 


AT. Let 


We demonstrate a one-to-one /: T 
f is one-to-one because if a ^ 6, f(a) ^ /(&). Therefore, T is countable. 




Let ~ be the binary relation “is the same size”. In other words If A ~ B, A 
and B are the same size. We show that ~ is an equivalence relation. First, ~ 
is reflexive because the identity function f(x) = x, Vx € A is a correspondence 
/: A — >A. Second, ~ is symmetric because any correspondence has an 
inverse, which itself is a correspondence. Third, ~ is transitive because if 
A ~ B via correspondence /, and B ~ C via correspondence g , then A 
via correspondence fog (the composition of / and g). Because ~ is reflexive 
symmetric, and transitive, ~ is an equivalence relation. 


4.9 


C 


The following TM X decides 4. 

: u On input ( M) where M is a DFA: 

1. Construct a DFA O that accepts any string containing an odd 

number of Is. 

2. Construct DFA B such that L(B) = L(M) fl L(0). 

3. Run TM T from Theorem 4.4 on input (JB), where T decides 

#DFA- 

4. If T accepts, accept . If T rejects, reject . 


4.10 


X 




We observe that L(R) C L(S) if and only if L(S) fl L(R) = 0. The following 
TM X decides A. 

: “On input (R, S) where R and S are regular expressions: 

1. Construct DFA E such that L(E) = L(S) fl L(R ). 

2. Run TM T from Theorem 4.4 on input ( E ), where T decides 

#DFA- 

% 

3. If T accepts, accept . If T rejects, reject . 


4.11 


X 




We have shown in Problem 2.17 that if C is a context-free language and R is 
a regular language, then CDR is context free. Therefore 1* fl L(G) is context 

free. The following TM X decides A. 

; “On input (G) where G is a CFG: 

1. Construct CFG H such that L(H) = 1* fl L(G). 

2. Run TM R from Theorem 4.7 on input ( H ), where R decides 

Ecfg* 


4.12 


X 


Instructor’s Manual 


35 


. If R accepts, reject . If T rejects, accept . 




* 


The following TM X decides A . 

: “On input (R) where R is a regular expression: 

1. Construct DFA E that accepts 

2. Construct DFA B such that L(B) = L(R) H L(E). 

3. Run TM T from Theorem 4.4 on input ( B ), where T decides 

Ed fa- 

4. If T accepts, reject . If T rejects, accept . 


4.14 


X 






for it; G {0,1}*}. A is a CFL. Problem 


Let A = {v| v is of the form wit; 

2.17 showed that B fl A is context free if B is regular. The following TM X 


4.15 


decides E . 


“On input ( M) where M is a DFA: 

1. Let C = L(ilf) fl A. Let G be the CFG of C. 

2. Rim TM R from Theorem 4.7 on input (G), where R decides 

•Ecfg* 

3. If R accepts, reject . If /£ rejects, accept . 

X decides £7 because a DFA M accepts some string of the form ww n if and 
only if L(M) fl A ^ 0. 


X 




For any DFAs A and B, L(A) = L(B) if and only if -4 and B accept the 
same strings up to length mn, where m and n are the numbers of states of 
A and B, respectively. Obviously, if L(A) = L(B)> A and B will accept the 
same strings. If L(a ) ^ L(B), we need to show that the languages differ on 
some string s of length at most mn. Let t be a shortest string on which the 
languages differ. Let l be the length of t. If / < mn we are done. If not, 
consider the sequence of states go > <Zi > • • • > Qi that A enters on input f, and 
the sequence of states ro,ri,... ,rj that B enters on input t. Because A has 
m states and B has n states, only mn distinct pairs (g,r) exist where q is 
a state of A and r is a state of B . By the pigeon hole principle, two pairs 
of states ( qi,r{ ) and ( qj ,rj) must be identical. If we remove the portion of 
t from i to j — 1 we obtain a shorter string on which A and B behave as 
they would on t. Hence we have found a shorter string on which the two 
languages differ, even though t was supposed to be shortest. Hence t must 
be no longer than mn. 


4.16 


We need to prove both directions. To handle the easier one first, assume that 

* 

D exists. A TM recognizing C operates on input x by going through each 

possible string y and testing whether (x, y) G D. If such a y is ever found, 

* 

accept ; if not, just continue searching. 


4.17 


r 


i 


si? 




ml 


I 8 1 


4 


* > 


that C is recognized by TM Af. Define a 
language £ to be {(x,y)| Af accepts x within |y| steps}. Language B is 

decidable, and if x E. C then Af accepts x within some number of steps, so 

« * * — 

(x,y) € B for any sufficiently long y, but if x £ C then (x,y) £ C for any y. 


For the other direction 






lllllT 




% and A and B are 


Let A and B be two languages such that A fl B 

■ • •• 

recognizable. Let J be the TM recognizing ~A and K be the TM recognizing 
2?. We will show that the language decided by TM T separates A and B. 


4.18 


“On input w: 

. _ * • * _ . 

• • . • * _ ■ _• 

1. Simulate J and K on w by alternating the steps of the two 

machines. 

4 

4 * • 

If J accepts first, reject . If K accepts first, accept. 

The algorithm T terminates because A U B 
accept w eventually. ACC because if w € A, w will not be recognized by 

J and will be accepted by if first. B C 72 because if w G £, w will not be 

• ••*’..*: . ■ 

• •• • 

recognized by K and will be accepted by J first. Therefore, C separates A 
and B . 


T 






E*. So either J or K will 


For any language A, let A n = {w n \ w 

L(M) 71 . The following TM T decides 5. 


€ A}. If (Af) e 5, then L(M) 


4.19 


“On input (Af), where Af is a DFA: 

1. Construct DFA N that recognizes L(M) n . 


T 


5 on (Af, 2V), where F is the Turing 


2. Rim TM F from Theorem 4 


machine deciding EQqfa 

\ *• m *v • , 

. If F accepts, accept . If F rejects, reject. 




K 


9 




{{P)\ P is a PDA that has useless states}. The following TM T 


Let U 
decides U. 

“On input (P), where U is a PDA: 

* , . . / • 

»»••••■• • % • 

1. For each state q of P: 

• • • . • . * 

Modify P so that q is the only accept state. 

• . ^ .*•%••• . . . 1 • 

- . - . • ' ’ • ‘ 4 * % • • . * •'.;*••• • ‘ < 

Use the decider for Ep da to test whether the modified PDA 

• > • •%.■* •* • *■ • . . .* .- • • . • 

language is empty. If it is, accept. If it is not, continue. 

• * * % 

4. At this point, all states have been shown to be useful, so reject. 


4.20 


T 


2 . 




s 


4 




Let (Afi), (Af 2 ),... be the 

} be a list of all strings 


Use the diagonalization method to obtain D. 
output of an enumerator for A. Let E = {$i, 82 
over the alphabet. The algorithm for D is: 

“On input w: 

1. Let i be the index of w on the list of all strings, that is, Si 

* 

€ 

2. Run (M{) on input w. 


4.21 


I • * * 


D 


w. 
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. If it accepts, reject. If it rejects, accept. 

h 

D is a decider because each Mi is a decider. But D doesn’t appear on the 
enumeration for A , because it differs from on input 




Is 


Let K be the enumerator for B. The following TM L is the lexicographic 
order enumerator for C. 


4.22 


L = “Ignore the input. 

1. Enumerate machines Mi by K. 

2. For each Mi, insert useless states, symbols or transitions to ob¬ 
tain Ni such that L(Mi) = L(N,) and (Ni) is lexicographically 
larger than all (Nj) for 1 < j <i. Output Ni. 




Let (N x ), (N 2 ) 

for all i, every machine described in B has an equivalent one in C and vice 
versa. C is decidable because it enumerated by L in lexicographic order, by 
virtue of Problem 3.16. 


be the strings enumerated by L. Since L(Ni) = L(Mi) 


• » t 


Chapter 5 


Suppose for a contradiction that JBQcfg were decidable. We construct a 
decider M for ALL & G = {(G )| G is a CFG and L(G) = £*} as follows: 

M = “On input ( G ): 

1. Construct a CFG H such that L(H) = £*. 

2. Run the decider for EQcfg on ( G,H ). 

• If it accepts, accept . If it rejects, reject. 

* 

M decides ALLqfq assuming a decider for EQqfg exists. Since we know 
ALLcfg Is undecidable, we have a contradiction. 


.1 






S 




Here is a Taring Machine M which recognizes the complement of EQ CF g : 

. ; •• • . , •»* 

■ * , • » « * . 

• “On input (G,H)i 

1. Lexicographically generate the strings x G £*. 

2. For each such string x: 

Test whether x E L(G) and whether x E L(H), using the 
algorithm for j4cfg* 

If one of the tests accepts and the other rejects, accept ; oth¬ 
erwise, continue.” 




M 




4. 


f ab -j f ab ] [aba*] fb] fb*| faa] paa] 

LababJ LababJ L"b~j LaJ LaJ LTJ LTJ* 


Here is a match: 







y p 


/I 


93 I 


► • 


example, {a n b n c n | n > 0} < 


No, it does not imply that A is regular. For 

* ’ « * * *4" 

{a n b n | n > 0}. The reduction first tests whether its input is a member of 
{a n b n c n | n > 0}. If so, it outputs the string 
string 


5.4 


m 




ab, and if hot, it 


<: 


*•» 


■A 


t 


£ 


* \ 


5. 




I 






via the 


from the definition of mapping reducibility that 
reduction function /. Observe that Ej m is 
not Turing-recognizable, contradicting Theorem 5 




if :1 m 


1 


TM I 


m 


/ 


lie 






22 






•* % 


'S 


e computable functions / and 

• * • V . 

y(y) € C. Consider the 
We can build a TM which computes 

a TM exists since we assumed 

Then simulate a 


Suppose A < m B and B 


M 

J. 


5. 


:1| 


V • 




* » 


such that a: € A 


€ B and y 


€ B 




function composition 

follows: first simulate a TM for / 


h 


:k 


and call the output y 


that / is 

TM for g on y. The output is h(x) = g(f(x)). Therefore, h is a computable 
function. Moreover, x E A 










h(x) E C. Therefore, -A < 


C via the 




reduction function h. 


• • 


W. 


- 4 




Suppose A < m Then, A < 
Because A is 
recognizable, and then 


A, by the definition of mapping reducibility. 
Turing-recognizable, Theorem 5.22 implies that A is Turing 


5.7 


m 


Theorem 4.16 implies that A is decidable. 


• * 


• £■< 


We need to handle the case where the head is at the leftmost tape call and 

• *• • ^ • 

attempts to move left. To do so we add dominos 








r #qg ] 

L#rfeJ 


For every q,r E Q and a,b € I\ where S(q,a ) = (r, b, L). 




*> 


Suppose L is a Turing-recognizable language and let M be a TM that rec 
ognizes it. To reduce L to Atm we map any string x to (Af,x). Then 
x E L (M, x) E Ajm* In addition, this mapping is computable, so it 
gives a mapping reduction from L to Atm • 




To prove that J is not Turing-recognizable, we show that Atm < m J- The 
reduction function maps any string y to the string ly. Then y E Atm 
ly E J. This mapping is computable, so we have shown that Atm < 

To show that J is no t Turing-recognizable we show that Atm <m J (since 

J 4=> Atm <m J )• The reduction function maps any string x to 

• * •* - 

computable, so we have 


5.10 


J. 




Atm < 

Ox. Then x E Atm <==> Ox E J. This mapping is 
shown that Atm <m J • 
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Use the language J from Problem 5.10. That problem showed that J is 
undecidable. Here we show that J < m J. The reduction / maps e to itself 
and for other strings x, 


.11 






0 8 if x 
1 8 if x 


Is for some string s G £ 
Os for some string s G E 


/(*) 


/(x) G J. Thus, J < m J. 


The function / is computable, and x G J 


We show that >1 tm <m S by mapping (M,w) to (M') where M' is the 
following TM: 

= “On input x: 

✓ 

1. If x = 01 then accept . 

2. If x ^ 10 then reject. 

. If x = 10 simulate M on w. If M accepts w then accept ; if M 
halts and rejects w then reject . 

If (M,w) G j4tm then M accepts w and L(M') = {01,10}, so (M') G 5. 
Conversely, if (M,w) £ Ajm then L(Af') = {01}, so (M r ) £ S. Therefore, 

(M, w) G i4 T M 


5.12 




M 






(AT) G 5. 


Let U523L2555 tm = {(Af)| M is a TM with 1 or more useless states}. We 
show USELESSjm is undecidable by reducing Ejm to it. Let R be a TM that 

decides USELESSjm and construct a TM S that decides Ejm as follows. 

a 

S = “On input 

1. First we construct a TM N that simulates M on w but ensuring 

that all of M’s original states are not useless. The TM N has 
input alphabet {0,1,2}. On input 0, N goes through each of 

the original states of M, except for Accept and Reject > until all 
nonhalting states of M have been used, and then enters state 

^accept- On input 1, N enters state Reject• On input 2, N 

simulates M on w. Finally N restores the original symbol to 

% 

where the new symbol had been written, and branches to M’s 
start state to simulate M. If that simulation is about to enter 
M’s accept state, it instead enters a new state r. 

2. Run USELESSjm on input ( N ). If it accepts, reject. If it 

rejects, accept. 

The only possible useless state of N is the state r, because on inputs 0 and 

a 

1 it uses all other states. The only way for N to use state r, is if M accepts 
w. So, testing whether N has any useless states, in effect tests whether M 

M •• • 

accepts w. Note that the hardest part of this problem is making sure that 
the machine N has no “unexpected” useless states. 


.13 
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{(M,u/)| M on w tries moving its head left when on the leftmost 

e show that if it 


Ljm 

tape cell}. This language is undecidable. To prove it, 

decidable by some machine say R then we could decide -Atm with the 
following machine S: 


.14 


kVi 


SEE? 


44 0n input (M>w): 

1. Convert M to M', where M' first moves its input over one 

square to the right and puts a special delimiter on the leftmost 
tape cell. Then M' simulates M on the input. If M' ever 
delimiter (M must have tried to move left when on the leftmost 
tape cell), M' automatically moves right and stays in the same 
state. If M accepts, it moves its head all the way to the left 
and then moves left off the leftmost tape cell. 

2. Run R, the decider for Ljm> on (M',tu). 

3. If R accepts then accept. If it rejects, reject. 

TM 5 decides -Atm since the only way M' moves left from the leftmost tape 
cell is when M accepts w. S never attempts the move during the course 
of the simulation because we put the special delimiter to “intercept” such 

moves if made by M. 


S 


es a 


i 




Let LMjm = {(M,w)\ M ever moves left while computing on w}. LMjm 
is decidable. Let Mleft be the TM which on input (M, w) determines the 
number of states tim = \Qm\ of M and then simulates M on w for \w\+n\f+l 
steps. If Mleft discovers that M moves left during that simulation, Mleft 
accepts (M, w). Otherwise Mleft rejects (M,u;). 

The reason that Mleft can reject without simulating M further is as follows. 
Suppose M does make a left move on input w. Let p = go> Qu • • • > <7« be the 
shortest computation path of M on w ending in a left move. Because M 
has been scanning only blanks (it’s been moving right) since state q| w |, we 
may remove any cycles that appear after this state and be left with a legal 
computation path of the machine ending in a left move. Hence p has no 

cycles and must have length at most |u>| +um +1. Hence Mleft would have 
accepted (M,tc;), as desired. 


.15 


{(M) | M is a two tape TM which writes a non-blank symbol on 


Let B 

its second tape when it is simulated on some particular input w}. We show 
that Atm B by mapping (M, w) to (M') where M' has the following 
description: 


5.16 


44 0n input x: 

1. Simulate M on w only using the first tape. 

2. If M accepts w then write blah blah ! on the second tape. 

Observe that (M, w) € -Atm 


M 




M' e B. 
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The PCP over a unary alphabet is decidable. We describe a TM M that 
decides unary PCP . Given a unary PCP instance 


5.17 


us.a 


M = “On input (ai, &i,... ., a n , b n ): 

1. Check if a* = 6» for some i . If so, accept . 

2. Check if there exist i,j such that a* > 6* and ay < bj. If so, 

accept . Otherwise, reject. 

In the first stage, M first checks for a single domino which forms a solution. 
In the second stage, M looks for two dominos which form a solution. If it 
finds such a pair, it can construct a solution by picking (bj — aj) pieces of the 
i th domino, putting them together with (a* — 6*) pieces of the j th domino. 
This construction has ai(bj - aj ) + aj (a* — &,) = a»6y — ajbi Is on the top, 
and bi(bj — ay) 4* 6y(a< - 6*) = a<6y — ay 6* Is on the bottom. If neither stages 
of M accept, the problem instance contains dominos with all the upper parts 

j 

m 

having more (or less) Is than the lower parts. In such a case no solution can 
exist. Therefore M rejects. 




We construct a computable function that takes a PCP instance of any finite 

/ 

alphabet and produces a binary alphabet PCP (BPCP ) instance. As a result 
we show that BPCP is undecidable. 


5.18 


}, the function 


For any PCP instance A over a finite alphabet {ai 
encodes each character ai into binary as 10*. The function is a mapping 

• • 0 

# • 

reduction from the general PCP problem to the BPCP problem. For any P 

in PCP r the converted instance P' is in BPCP because a match for P becomes 

a match for P' using the same set of dominos after conversion. Conversely, if 

• • | • • • 

• • . * • * • 

P' is in BPCP, P is in PCP, because we can uniquely decode a match in P 
to a match in P. Therefore, the function is a mapping reduction from PCP 


• j fl n 


• • 






to BPCP 


We reduce PCP to AMBIGcfq , thereby proving that AMBIGqfq is unde¬ 
cidable. We use the construction given in the hint in the text. 

• • • . 

Note: The hint that appears in the first printing is incorrect. The correct 
version (which appears in the second and subsequent printings) is: Given an 
instance 


5.19 


M.[£]} 


p 


s 


T | B 

tiTai 
61 B&i 


I tkT*k | *i»i | ••• I 

j bkB&k j 61 ai 

where ai, ... , a* are new terminal symbols. We prove that this reduction 
works. 


T 


• • • 






• • • 


• • • 
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We show that P has a solution iff the CFG is ambiguous. If P has a match 
UiU 2 -"ti t = bi x bi 2 ... b it , the string 



has two different leftmost derivations, one from T and one from B. Hence 
the CFG is ambiguous. 

Conversely, if the CFG is ambiguous, some string w has multiple leftmost 
derivations. All generated strings have the form w = w 8ta rt*ii - - - where 
w s t a rt contains only symbols from P’s alphabet. Notice that once we choose 


the first step in the derivation of w (either S -> T or 5 -* B), the following 
steps in the derivation are uniquely determined by the sequence a*,... 


a» 2 at 


Therefore w has at most two leftmost derivations: 



and 

ii) S' P —► —> - - - —> bi x bi 2 ... £>»,a*,... a^a^. 

If w is ambiguous, ti x ti 2 ... t^a*,... a^a^ = b^ b{ 2 ...&», a», ... a^ 3 a» x , and 
therefore U X U 2 ... U x = bi x b { 2 .. .6< r Thus, the initial PCP instance, P, has 

a match. 


5.20 a. A 2DFA M with s states computing on an input x of size n has at most 

s(n+2) 2 possible configurations. Thus, on input x, M either halts in s(n-f 2) 2 
steps or loops forever. To decide A. 2 dfa> ft Is enough to simulate M on x for 
s(n 4- 2) 2 steps and accept if M has halted and accepted x during this finite 

duration. 

b. Suppose a TM D decides P 2 DFA- We construct a TM E that decides Ptm« E 

runs as follows: 

E = “On input (M), 

1. Construct a 2D FA M’ that recognizes the language of accepting 

computation histories on M, namely, {(ci#C 2 #.. .#c*)| c\ is a 
configuration of M, c\ is a start configuration, c* is an accept¬ 
ing configuration, c »+1 legally follows c\ for each i}. A 2DFA 
can check whether c\ is a start configuration of M by verifying 
that it begins with a start state qo and contains legal sym- 

a 

bols from M’s input alphabet. Similarly, it can check whether 
c*. is an accepting configuration by verifying that it contains 

^accept and symbols from M’s tape alphabet. These two steps 
use only one head. To check whether c *+1 legally follows c», 
the 2DFA can keep its two heads on c* and Ci+ 1 , and compare 
them symbol by symbol. Notice that L(M') = 0 iff L(M) = 0. 

2. Rim D on (M'). If D accepts, accept . If D rejects, reject” 

Since Pjm is undecidable, P 2 DFA is decidable. 
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We formulate the problem of testing 2DIM-DFA equivalence as a language: 
•EQ 2 DIM-DFA = {(Mi, Af 2 )| Mi is equivalent to M 2 }. 

We show that if PQ 2 DIM-DFA is decidable, we can decide -Atm by reducing 

-Ajm to 2?Q2DrM-DFA‘ Therefore PQ 2 DIM-DFA is undecidable. Assume M is 
the TM deciding PQ 2 DiM-DFA‘ We construct S that decides A 


follows: 


0 




s 


“On input (A, it;), 

1. Construct a 2DIM-DFA T that accepts only the accepting com¬ 
putation history of A on it/. The input that T accepts should 
represent the computation history of A in the form of an m x 
rectangle, where (m - 2) is the length of the longest configura¬ 
tion that A ever has during the computation and (n — 2) is the 
number of steps it takes to get to the accepting state. Each row 
of the inner cells contains a proper configuration of A during 
its computation. In order to check if the input is a computa¬ 
tion history of A on u/, T follows a procedure similar to that 
given the proof of Theorem 5.9. It first checks that the first 
inner row contains the configuration with input w and the start 
state at the left end. Then it checks whether the TM reaches 




the accept state at the last inner row. After that, it compares 
two consecutive rows of input to see if the two configurations 
follow the transition function of A. If the input passes all the 
three tests, the 2DIM-DFA accepts. Otherwise, it rejects. 

2. Construct another 2DIM-DFA E which always rejects. Run M 

e equivalent. If M accepts, reject 


on (T, E) to see if T and E 
because there is no computation history for A on it/. Otherwise, 

accept . 


Si 




The T M S decides Atm • However 
fore PQ 2 DIM-DFA is also undecidable. 


e know that Atm is undecidable. There- 


kVi 


Suppose P is a decidable language satisfying the properties and let T P be 
a TM that decides P . Without loss of generality assume (T 0 ) £ P with 

L ( T 0 ) 

there exists a TM Mi with (Mi) € P. We can decide Atm by constructing 
the following TM: 

“On input (M,ti/): 

1. Construct a TM M w that accepts on input x iff M accepts w 

and Mi accepts x . 

2. Run Tp on (M w ). If it accepts, reject. Otherwise, accept.” 

If w € L(M ), L(M W ) = L(Mi) and M w should be accepted by T P since 

(Mi) e P. If ti/ g -L(M), L(M W ) 

the assumption that (T 0 ) £ P. Therefore we have w e L(M) iff (M x ) e P. 
However, since Atm is undecidable, P is not decidable either. 


5.22 




since we can consider P if (T$) is in P. Since P is not trivial, 


X(Tg) and should be rejected by 
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First, let P be the language {{M)\ M is a TM with 5 states}. P is non¬ 
trivial, and so it satisfies the second condition of Rice’s Theorem but P 

be easily decided by checking the number of states of the input Turing 
Machine. Second, let P be the empty set. Then it does not contain any TM 

it satisfies the first condition of Rice’s Theorem, but and P can be 


5.23 


can 


and 

decided by a TM that always rejects. 

Therefore both properties are necessary for proving P undecidable. 




Chapter 6 


The following program is in LISP: 

((LAMBDA (X) (LIST X (LIST (QUOTE QUOTE) X))) 

(QUOTE (LAMBDA (X) (LIST X (LIST (QUOTE QUOTE) X))))) 


6.1 


We build up two incomparable sets A and B to kill off all possible Turing- 
reductions, Hi, Ha,... . The sets start off undefined. 

First, extend A to be longer and longer initial portions of the empty set 
and B to be longer and longer initial portions of ATM. Eventually, Hi must 

get the wrong 

B interchanging the empty set and ATM until Hi errs in reducing A to B. 
Continue with Ha, etc. 






when trying to reduce B to A. Then extend A and 




El iT:iVi 


Chapter 7 


a. True 

b. False 

c. False 

d. True 

e. True 

f. True 


7.1 


a. False 

b. True 

c. True 

d. True 

’ r 

e. False 

f. False 


7.2 


We can use the Euclidean algorithm to find the greatest common divisor. 


7.3 
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1274 x 8 + 313 
313 x 4 + 22 
22 x 14 + 5 
5x4 + 2 

• 4 

2x2 + 1 
1x2 + 0 


1274 


313 


22 






The greatest common divisor of 10505 and 1274 is 1. Therefore they are 

relatively prime. 


b. 


7289 x 1 + 740 
740 x 9 + 629 

629 x 1 + 111 
111 x 5 + 74 
74 x 1 + 37 
37 x 2 + 0 


8029 


7289 


740 


629 


111 


74 


The greatest common divisor of 8029 and 7289 is 37. Therefore they 

relatively prime. 


e not 


7.4 


The table constructed using the algorithm from Theorem 7.14 is 


follows: 


[:v> 


l 








T 


1 


T,R 


S 


S,R,T 




R 


S 


s 




T 


T,R 




R 


Because table( 1,4) contains S , the TM accepts w. 


The formula is not satisfiable. For any assignment of the boolean values for 

and y, it always makes one of the four clauses false. Therefore, the formula, 
which i 

of x and y. 


7.5 




a conjunction of the four clauses, is always false for any assignment 


7. 


P is closed under union. For any two P-languages Lj and L 2 , let Mi and M 2 
be the TMs that decide them in polynomial time. We construct a TM M' 
that decides the union of L\ and £>2 in polynomial time: 

= “On input ( w ): 

1. Run Afi on w. If it accepts, accept. 

2. Run M 2 on w. If it accepts, accept. Otherwise, reject. 


r* j 




M 
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M r accepts w if and only if either Mi and M2 accept w. Therefore, M 
decides the union of L\ and L 2 . Since both stages take polynomial time, the 

algorithm runs in polynomial time. 

P is closed under concatenation. For any two P-languages L\ and X 2 , let M\ 
and M 2 be the TM sthat decide them in polynomial time. We construct a 
TM M' that decides the concatenation of L\ and L 2 in polynomial time: 

= “On input (w): 

1. For each way to cut w into two substrings w 

Run Mi on wi and M 2 on iu 2 . If both accept, accept. * 

3. If w is not accepted after trying all the possible cuts, reject. 

M' accepts w if and only if w can be written as W 1 W 2 such that M\ accepts 
w\ and M2 accepts u; 2 . Therefore, M' decides the concatenation of L\ and 
L 2 . Since stage 2 runs in polynomial time and is repeated at most 0 (n) 

times, the algorithm runs in polynomial time. 

P is closed under complement. For any P-language L, let M be the TM 
that decides it in polynomial time. We construct a TM M' that decides the 

complement of L in polynomial time: 

= “On input (w): 

1. Rim M on w. 

2. If M accepts, reject. If it rejects, accept. 

M 1 decides the complement of L. Since M runs in polynomial time, M f also 
runs in polynomial time. 


M 


W1W2' 


2 . 




M f 




NP is closed under union. For any two NP-languages L\ and L 2 , let M\ and 
Af 2 be the NTM sthat decide them in polynomial time. We construct a NTM 
Af' that decides the union of L x and L 2 in polynomial time: 


7.7 


= “On input (w): 

1. Run Mi on w. If it accepts, accept. 

2. Run M 2 on w. If it accepts, accept. Otherwise, reject. 


M' 




In both stages 1 and 2, M' uses its nondeterminism when the machines being 

make nondeterministic steps. M' accepts w if and only if either M x and 

Since both 


run 

M 2 accept w. Therefore, M r decides the union of L x and I> 2 . 
stages take polynomial time, the algorithm runs in polynomial time. 

NP is closed under concatenation. For any two NP-languages L x and L 2 , let 
Mi and M 2 be the NTM sthat decide them in polynomial time. We construct 
NTM M' that decides the concatenation of L x and L 2 in polynomial time: 




= “On input (w): 

1. For each way to cut w into two substrings w 

Run Mi on w x . 

Run Af 2 on w 2 . If both accept, accept ; otherwise continue 
with the next choice of w x and tu 2 . 

4. If w is not accepted after trying all the possible cuts, reject. 


M 


W1W21 


2 . 


3. 
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In both stages 2 and 3, M' 

being rim make nondeterministic steps. M 1 accepts w if and only if w can be 

expressed as W 1 W 2 such that M\ accepts w\ and M 2 accepts W 2 - Therefore, 

M f decides the concatenation of L\ and L 2 . Since stage 2 runs in polynomial 

time and is repeated for at most 0(n) time, the algorithm runs in polynomial 
time. 


es its nondeterminism when the machines 


im 


7.8 


If we use unary encoding, the input number n has length n. An algorithm can 
determine the primality of n by checking sequentially whether n is divisible 
by any integer from 2 to n. If any of the numbers divide n, reject . Otherwise 
accept . The time to do the division is polynomial in n, and the total number 
of divisions done is at most n. Hence the algorithm runs in polynomial (in 

n) time. Therefore, UNARY-PRIMES is in P. 


7.9 


The algorithm given in page 145 runs in 0(n 3 ) time. Stage 1 takes at most 
0(n) steps to locate and mark the start node. Stage 2 causes at most n + 
1 repetitions, because each repetition except the last marks at least 
additional node. Each execution of stage 3 uses at most 0(n 3 ) steps because 
G contains at most n to be checked and for each checked node, examining 
all adjacent nodes to see whether any have been marked uses at most 0 (n 2 ) 
steps. Therefore in total, stages 2 and 3 take 0(n 4 ) time. Stage 4 uses 0(n) 
steps to scan all nodes. Therefore, the algorithm runs in 0(n 4 ) time and 

CONNECTED is in P. 


one 


7.10 


We construct a TM M that decides TRIANGLE in polynomial time. 

= “On input (G) where G is a graph: 

• • . 

1 . For each triple of vertices v \, V 2 , V 3 in G : 

If edges (vi,V 2 )> (^ 1 ,^ 3 ), and (t> 2 >V 3 )> are all edges of G , 
accept . 

3. No triangle has been found in G, so reject .” 

A graph with m vertices has (™) 

Therefore, stage 2 will be repeated at most 0(m 3 ) times. In addition, each 
stage can be implemented to rim in polynomial time. Therefore, TRIANGLE £ 


M 


2 . 


m! 


? = 0 (m 3 ) triples of vertices. 


!(m 


P. 


7.11 


A nondeterministic polynomial time algorithm for ISO operates as follows: 


“On input (G, H) where G and H 


e undirected graphs: 

1 . Let m be the number of nodes of G and H . If they don’t have 

the same number of nodes, reject . 


Sfi 


2 . Nondeterministically select a permutation 7 r of m elements. 

3. For each pair of nodes x and y of G check that (x, y) is an edge 

of G iff ( 7 r(x), 7 r(y)) is an edge of H . If all agree, accept. If any 
differ, reject. 
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Stage 2 can be implemented in polynomial time nondeterministically. Stage 
3 takes polynomial time. Therefore ISO € NP. 


The most obvious algorithm multiplies a by itself 6 times then compares the 
result to c, modulo p. That 

the length of 6. Hence this algorithm doesn’t run in polynomial time. Here 
is one that does: 

u On input (a, 6, c,p), four binary integers: 

1. Let r 4— 1. 

. Let 6i,... , bk be the bits in the binary representation of b. 

. For i 

4. If bi = 0, let rf-r 2 mod p. 

If bi = 1, let r 4- ar 2 mod p. 

6. If (c mod p) = (r mod p), accept ; otherwise reject . 

The algorithm is called repeated squaring . Each of its multiplications can 
be done in polynomial time in the standard way, because the numbers are 
no larger than p. The total number of multiplications is proportional to the 
number of bits in 6. Hence the total running time is polynomial. 


7.12 


6 — 1 multiplications which is exponential in 


* 


1 

Xj • • • ) #V• 










We show that P is closed under the star operation by dynamic programming. 

Let A £ P, and M be the TM deciding A in polynomial time. On input w = 

the following procedure ST to construct a table T such that 

Wj € A* and T[i, j] = 0 otherwise, for all 1 < i < j < n. 


7.13 




wi---w n , 

T[i, j] = 1 if tu* 




• • • 


5T = “On input w = tui • • • 

1. If w = e, accept. 


[handle w = e case] 


2. Initialize T[i, j] = 0 for 1 < i < j < n. 
. For i = 1 to n, 

4. Set T[i, i] = 1 if € A 

5. For l = 2 to n, 

6. For i = lton-H-l 


[ test each substring of length 1 ] 




[ Z is the length of the substring ] 
[ i is the substring start position ] 
[j is the substring end position] 




Let j = i +1 - 1 
If • • • ujj 6 -A, set T[i, j] = 1. 

For k = i to j — 1 

If T[t,fe] = 1 and T[fe,j] = 1, set T[i, j] = 1. 

11. Accept if T[l,n] = 1; otherwise reject . 


7. 






[ As is the split position ] 


r 




10 . 




Each stage of the algorithm takes polynomial time, and ST runs for 0(n 3 ) 
stages, so the algorithm runs in polynomial time. 


Let NTM M decide A in nondeterministic polynomial time. The following 
NTM N decides A*. 

“On input w: 


7.14 


N 
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1. If w 


e, accept. 

2. Nondeterministically choose k , where 1 < k < n. 


3. Nondeterministically select strings , W 2 ,..., Wk , each of length 

at most n. 


4. If w ^ tui • • • it;*., then reject(on this branch). 

5. Simulate (using nondeterminism) M on each 

. If M accepts each u;*, then accept ; otherwise reject. 

Each stage takes polynomial time, so the algorithm runs in polynomial time. 


Wi. 








7.15 


The reduction fails because it would result in an exponential unary instance, 
and therefore would not be a polynomial time reduction. 

We give a polynomial time algorithm for this problem using dynamic pro¬ 
gramming. The input to the algorithm is (5, t) where S is a set of numbers 
{#i,... ,Xfc}. The algorithm operates by constructing a list Li of numbers 
for 0 < i < k giving the set of possible values that a subset of the numbers 

{xi,... , x»} could sum to. Initially Lo = {0}. For each i the algorithm con¬ 
structs Li from Li_i by assigning L* 

the algorithm accepts if t E Lfc, and rejects otherwise. 


U {a + Xi\ a € Li-i}. Finally, 


Li 


i —1 


7.16 


a. Follow the marking algorithm for recognizing PA TH while additionally keep¬ 
ing track of the length of the shortest paths discovered. Here is a more 

detailed description: 

% 

u On input (G, a, 6, k) where G is a directed graph on m nodes where 

* i • 

G has nodes a and b : 

4 

1. Place a mark with label 0 on node a. 

2. For each i from 0 to m: 

Scan all the edges of G. If an edge (s, t) is found going from 
a node s marked with i to an unmarked node 6, mark node 
t with i 4-1. 

_ * 

4. If t is marked with a value of at most fc, accept. Otherwise, 

reject. 

b. First, LPATH E NP because a nondeterministic machine can guess and 

verify a simple path of length at least k from a to b. Next, UHAMPATH < 
LPATH , because the following TM F computes the reduction /. 

“On input (G, a, 6), where G is an undirected graph, and a and b 
are nodes of G. 

1. Let k be the number of nodes of G. 

2. Output (G,a,fe,fc). 

If (G, a, b) E UHAMPATH , G contains a Hamiltonian path of length k from 
to 6, thus (G, a, 6, k) E LPATH. If (G, a, 6, k) E LPATH , G contains a simple 

path of length k from a to b. Since G has k nodes, the path is Hamiltonian. 

Thus (G, a, b) E UHAMPATH. 


3. 






F 
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0 and A = E*. To show that A is 


Let A be any language in P except A 
NP-complete, we show that A € NP and that every B € NP is polynomial 

tim> reducible to A. The first condition holds because A € P so A € NP. 

To demonstrate that the second condition is true, let Xj n € A and x ou t £ -A 

be two strings that are guaranteed to exist by virtue of our assumptions 

NP implies that B is recognized by a 
polynomial time TM M. A polynomial time reduction from B to A simulates 
M to determine whether its input w is a member of B, then outputs xj n or 


7.17 


about A. The assumption that P 


x out accordingly. 


On input 0, a nondeterministic polynomial time machine can guess two 
gignmpnts and accept if both assignments satisfy <j>. Thus DOUBLE-SAT is 
in NP. We show SAT < P DOUBLE-SAT. The following TM F computes 

the polynomial time reduction /. 

“On input (4>), a Boolean formula with variables xi,X 2 ,.. 

1. Let <j>' be <f> A (x V x), where x is a new variable. 

2. Output (</>')” 

If <f> € SA T , <f>' has at least two satisfying assignments because we can obtain 
two assignments from the original assignment of 4> by changing the value of 
If <f>' e DOUBLE-SAT, <f> is also satisfiable, because x does not appear ' 
6. Therefore <t> € SAT if and only if f(4>) 6 DOUBLE-SAT. 


as 


7.19 




F 




X. 


First, note that we can compose two permutations in polynomial time. Com¬ 
posing q with itself t times requires t compositions if done directly, thereby 
giving a running time that is exponential in the length of t when t is repre¬ 
sented in binary. We can fix this problem using a common technique for fast 

exponentiation. Calculate q 1 ,? 2 ,? 4 ,? 8 ,--. 
of 2 that is smaller than t. Each term is the square of the previous term. To 
find q* we compose the previously computed permutations corresponding to 
the Is in ts binary representation. The total number of compositions is 
at most the length of t, so the algorithm runs in polynomial time. 


7.20 




where k is the largest power 


now 


HA LF- CLIQ UE € NP because a nondeterministic machine can guess and 
verify a clique with at least m/2 nodes in polynomial time. We show that 
CLIQUE <p HALF-CLIQUE. The following TM F computes the reduction 


7.21 


/• 


“On input ( G , k) where G is an undirected graph with m nodes 
and k is an integer. 

1. If k = m/2 output { G). 

2. If k < m/2, construct G' by adding a complete graph with 

2k nodes and connecting them to all nodes in G. Output 


F 




IG'). 
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.If k > m/2, construct G" by adding 2k 
Output (G"). 

When k < m/2, if G has a fc-clique, G ' has a clique of size k + (m - 2Jk) = 
(2m - 2*)/2, thus G' € HALF-CLIQUE . If G' has a & + (m - 2fc)-clique, 
then at most m — 2A: of them come from the new nodes, thus G must contain 
a fc-clique. When k > m/2, if G has a fc-clique, G" has a clique of size 

(m 4- 2k - m)/2 = ife, thus G" € HALF-CLIQUE . If G" € HALF-CLIQUE , 

s 

G must contain a ^-clique because the half-clique cannot include any of the 
new isolated nodes. 




m isolated nodes. 




7.22 a. In 


/-assignment each clause has at least one literal assigned 1 and at 
least one literal assigned 0. The negation of an /-assignment preserves this 
property, so it too is an /-assignment. 

b. To prove that the given reduction works, we need to show that if the formula 

<j) is mapped to <\>\ then <f> is satisfiable (in the ordinary sense) iff <p r has an 
/-assignment. First, if <f) is satisfiable, we can obtain an /-assignment for <p ' 
by extending the assignment to <j> so that we assign Z{ to 1 if both literals y\ 
and j /2 in clause c» of <j> are assigned 0. Otherwise we assign Z{ to 0. Finally, 
we assign b to 0. Second, if $ has an /-assignment we can find a satisfying 
assignment to ^ as follows. By part (a) we may assume the /-assignment 
assigns 6 to 0 (otherwise, negate the assignment). This assignment cannot 
assign all of yi, j/ 2 , and y$ to 0, because doing so would force one of the clauses 
in </>' to have all 0s. Hence, restricting this assignment to the variables of <j> 
gives a satisfying assignment. 

c. Clearly, / SAT is in NP. Hence, it is NP-complete because 3SAT reduces 

to it. 




First, MAX-CUT 6 NP because a nondeterministic algorithm can guess the 
cut and check that it has size at least k in polynomial time. 

Second, we show / 3SAT <p MAX-CUT . Use the reduction suggested in 
the hint. For clarity, though, use c instead of k for the number of clauses. 
Hence, given an instance <\> of / 3SAT with v variables and c clauses. We 
build a graph G with 6ct; nodes. Each variable corresponds to 6c nodes; 
3c labeled X{ and 3c labeled xl. Connect each Xi node with each xl node 
for a total of 9c 2 edges. For each clause in </> select three nodes labeled by 
the literal in that clause and connect them by edges. Avoid selecting the 
same node more than once (each label has 3c copies, so we cannot run out 
of nodes). Let k be 9c 2 v -f 2c. Output (G, k). Next we show this reduction 
works. 

We show that <f> has a /-assignment iff G has a cut of size at least k. Take an 
/-assignment for <fi. It yields a cut of size k by placing all nodes corresponding 
to true variables on one side and all other nodes on the other side. Then the 
cut contains all 9c 2 v edges between literals and their negations and two edges 
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for each clause, because it contains at least one true and one false literal, 
yielding a total of 9 c 2 v 4 * 2 c = k edges. 

For the converse, take a cut of size k. Observe that a cut can contain at most 
two edges for each clause, because at least two of the corresponding literals 
must be on the same side. Thus, the clause edges can contribute at most 
2 c to the cut. The graph G has only 9 c 2 v other edges, so if G has a cut of 

i 

size Jb, all of those other edges appear in it and each clause contributes its 

. Hence, all nodes labeled by the same literal occur on the same 
side of the cut. By selecting either side of the cut and assigning its literals 
true, we obtain an assignment. Because each clause contributes two edges to 
the cut, it must have nodes appearing on both sides and so it has at least 
one true and one false literal. Hence (f) has a ^-assignment* 

(A minor glitch arises in the above reduction if 0 has a clause that contains 
both x and x for some variable x. In that case G would need to be a multi¬ 
graph, because it would contain two edges joining certain nodes. We can 
avoid this situation by observing that such clauses can be removed from <f> 
without changing its ^-satisfiability.) 


fiiT;V4| liMkiii 


To show that all problems in NP are polynomial time reducible to D we 
show that 3SAT <p D. Given a 3 cnf formula <t> we construct a polynomial 
p with the same variables. We represent the variable x in <f> by x in p and 
its negation x by 1 — x. Next we represent the Boolean operations A and V 

• I _ 

in <f> by arithmetic operations that simulate them, as follows. The Boolean 
expression y\ A yi becomes yijfa and Vi V 2/2 becomes (1 — (1 — l/i)(l — 2 / 2 ))- 
Hence, the clause (yi V y<x V y$) becomes (1 - (1 - yi)(l — 2 / 2 )(l - 2 /s))* 
The conjunction of the clauses of 0 becomes a product of their individual 

representations. The result of this transformation is a polynomial, < 7 , which 
simulates <j> in the sense they both have the same value when the variables 
have Boolean assignments. Thus the polynomial 1 — q has an integral (in 
fact Boolean) root if <j) is satisfiable. However, 1 — q does not meet the 
requirements of p, because 1 — q may have an integral (but non-Boolean) 
root even when <f> is unsatisfiable. To prevent that situation, we observe 

that the polynomial r = (xi(l - Xi)) 2 (x 2 (l — X2)) 2 • • • (x m (l — x m )) 2 is 0 
only when the variables xl,... , x\ take on Boolean values, and is positive 

otherwise. Hence polynomial p = (1 — q) 2 -f r has an integral root iff <j> is 
satisfiable. 


7.24 


U is in NP because on input (d,x, 1*) a nondeterministic algorithm can sim¬ 
ulate Md on x (making nondeterministic branches when Md does) for t steps 
and accept if Md accepts. Doing so takes time polynomial in the length of 
the input because t appears in unary. 

To show 3SAT <p i7, let M be a NTM that decides 3SAT in time cn 

for some constants c and k. Given a formula 0, transform it into w = 
{(Af),<£, l c M fc ). By the definition of M, w G U if and only if <p £ 3SAT. 


7.25 
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7.26 


PUZZLE is in NP because we can guess a solution and verify it in polynomial 
time. We reduce 3SAT to PUZZLE in polynomial time. Given a Boolean 
formula 0, convert it to a set of cards as follows. 

Let * * 

Say that z G Cj if z is one of the literals i 
1 and column 2 with l possible holes in each column numbered 1 through l. 
When placing a card in the box, say that a card is face up if column 1 is on 
the left and column 2 is on the right; otherwise face down . The following 
algorithm F computes the reduction. 

“On input <t>: 

# 

1. Create one card for each X{ as follows: in column 1 punch out 

all holes except any hole j such that x, G Cj\ in column 2 punch 
out all holes except any hole j' such that xl G Cj 

2. Create 

and no hole in column 2 punched out. 

. Output the description of the cards created. 

Given a satisfying assignment for <f> } a solution to the PUZZLE can be con¬ 
structed as follows. •For each assigned True (False), put its card face up 
(down), and put the extra card face up. Then, every hole in column 1 is 
covered because the associated clause has a literal assigned True, and every 

hole in column 2 is covered by the extra card. 

/ 

Given a solution to the PUZZLE , a satisfying assignment for cf> can be con¬ 
structed as follows. Flip the deck so that the extra card covers column 2. 
Assign the variables according to the correponding cards, True for up and 
False for down. Because every hole in column 1 is covered, every clause must 
contain at least one literal assigned True in this assignment. Hence it satisfies 


be the variables of <j> and let ci, C 2 ,... , ci be the clauses. 

Let each card have column 


• > &m 




III! 


m 


F 


/ . 


extra card with all holes in column 1 punched out 


^ 11 


* 




<f>. 


SET-SPLITTING in in NP because a coloring of the elements can be guesses 
and verified to have the desired properties in polynomial time. We give a 
polynomial time reduction / from 3SAT to SET-SPLITTING as follows. 

y}. In other words, S con¬ 
tains an element for each literal (two for each variable) and a special element 
y. For every clause c* in <f> } let Ci be a subset of S containing the ele¬ 
ments corresponding to the literals in c* and the special element y G 5. We 
also add subsets C Zi for each literal, containing elements and x7. Then 

C = {Ci,... , Ci , Cxi )••• >}. 

If ^ is satisfiable, consider a satisfying assignment. If we color all the true 
literals red, all the false ones blue, and y blue, then every subset Ci of S has 
at least one red element (because Ci of 0 is “turned on” by some literal) and it 
also contains one blue element (y). In addition, every subset C Xi has exactly 
one element red and one blue, so that the system (£, C) G SET-SPLITTING . 

If (5, C) G SET-SPLITTING , then look at the coloring for 5. If we set the 
literals to true which are colored differently from y, and those to false which 
are the same color as y we obtain a satisfying assignment to <f>. 


7.27 


Given a 3cnf <f>> we set S = {xi,xT,.. 
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e binary integers and a = pq for b < p < c}. We 
see that F € NP because a nondeterministic algorithm can guess p and check 
that p divides a and that b < p < c in polynomial time. If P = NP then 
jF e P. In that case, we can locate a factor of a by successively running F 
on smaller and smaller intervals 6,c, in effect performing a binary search on 
the interval 1 to a. The running time of that procedure is the log of the size 
of the interval, and thus is polynomial in the length of a. 


{(a,6,c)| a,b,c 


Let F 


7.28 


Assuming P = NP, we have a polynomial time algorithm deciding SAT . To 
produce a satisfying assignment for a satisfiable formula <p> substitute x\ = 0 
and xi = 1 in <p and test the satisfiability of the two resulting formulas <po 
and <p i. At least one of these must be satisfiable because <p is satisfiable. 
At least one of these must be satisfiable because <p is satisfiable. If (po is 
satisfiable, pick x\ = 0; otherwise (pi must be satisfiable and pick x\ = 1. 
That gives the value of xi in a satisfying assignment. Make that substitution 
permanent and determine a value for x^ in that satisfying assignment in the 
same way. Continue until all variables been substituted. 


7.29 


NP, then CLIQUE is recognizable in polynomial time. We show 

4 

how-to compute MAX-CLIQUE using CLIQUE . Let m be the number of 

* 

nodes in the input graph. For each i from 1 to m, test whether there exists 

F 

a clique of size i using the polynomial time algorithm for CLIQUE. Output 

• • . • 

the largest such i for which a clique exists. 

To find the 


If P 


7.30 


clique, we start with i, the maximum clique size. 
Remove one node and see if there is still a clique of size i . If not, restore that 
node and remove another. If so, iterate the process until we are left with a 
graph of i nodes, which must be a clique. This process takes at most m trials 
to find which node to remove, and at most m nodes need to be removed. The 
total running time is therefore polynomial. 


in>:V4iM\ili«i 


Test whether a path exists from the start state to each non-accepting state. 

4 

[this problem is too easy—should have been only an exercise] 


7.31 


If we used 2x2 windows, we might not detect certain invalid computation 
histories where multiple heads occurred in the same row. For example, if 
the NTM had the nondeterministic choice of moving its head left or right 
when in a certain state reading some symbol, the next configuration must 
have one of these possibilities, but not both simultaneously. However, using 
a 2 x 2 window, a row that contained two state symbols corresponding to 
each possible next move would not appear incorrect, because each window 
would itself be legal. 

Furthermore, once two heads appeared in a single row, they could collaborate 
to make the final row accepting, even though no actual accepting computa¬ 
tion existed. 


7.32 
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The given algorithm does use exponential time, but a different, polynomial 
time algorithm could conceivably exist. Hence, SAT might have polynomial 
time complexity. The error in the proof occurs in the sentence, “Thus SAT 
has exponential time complexity” because that conclusion doesn’t follow log¬ 
ically from the previous statements. 


7.33 


3COLOR is in NP because a coloring can be verified in polynomial time. We 
show 3SAT <p 3COLOR . Let <t> 
variables xi,£ 2 > * * * 

G# containing 2m + 6/4-3 nodes: 2 nodes for each variable; 6 nodes for each 
clause; and 3 extra nodes. We describe Q# in terms of the subgraph gadgets 
given in the Hint. 


7.34 


ci A C2 A • • • A c\ be a 3cnf formula over 
where the c*’s are the clauses. We build a graph 






G# contains a variable gadget for each variable X{, two OR-gadgets for each 

clause, and one palette gadget. The four bottom nodes of the OR-gadgets 

■ 

will be merged with other nodes in the graph, as follows. Label the nodes 
of the palette gadget T, F, and R. Label the nodes in each variable gadget 
4- and 


and connect each to the R node in the palette gadget. In each 
clause, connect the top of one of the OR-gadgets to the F node in the palette. 
Merge the bottom node of that OR-gadget with the top node of the other OR- 
gadget. Merge the three remaining bottom nodes of the two OR-gadgets with 
corresponding nodes in the variable gadgets so that if a clause contains the 
literal Xi , one of its bottom nodes is merged with the 4 node of whereas 
if the clause contains the literal x7, one of its bottom nodes is merged with 

node of X{. 


the 


To show the construction is correct, we first demonstrate that if <f> is satisfi- 

f 

able, the graph is 3-colorable. The three colors are called T, F, and R. Color 
the palette with its labels. For each variable, color the 4- node T and the — 

s 

node F if the variable is True in a satisfying assignment; otherwise reverse 
the colors. Because each clause has one True literal in the assignment, we 
can color the nodes of the OR-gadgets of that clause so that the node con¬ 
nected to the F node in the palette is not colored F. Hence we have a proper 

3-coloring. 

If we start out with a 3-coloring, we can obtain a satisfying assignment by 
taking the colors assigned to the 4- nodes of each variable. Observe that 
neither node of the variable gadget can be colored R, because all variable 
nodes are connected to the R node in the palette. Furthermore, if both 
bottom nodes of an OR-gadget are colored F, the top node must be colored 
F, and hence, each clause contain a true literal. Otherwise, the three bottom 
nodes that were merged with variable nodes would be colored F, and then 
both top nodes would be colored F, but one of the top nodes is connected to 
the F node in the palette. 


We build an NFA in polynomial time that accepts all non-satisfying assign¬ 
ments. The NFA operates by guessing a clause is not satisfied by the as¬ 
signment and then reading the input to check that the clause is indeed not 


7.36 


Theory of Computation 


56 


satisfied. More precisely, the NFA contains l(m + 1) + 1 states, where l is 
the number of clauses and m is the number of variables in <f >. Each clause Cj 
is represented by m + 1 states i,... , 9j, m +i* The start state go branches 
nondeterministically to each state qij on e. Each state qij branches to qij+i 
for j < m. The label on that transition is 0 if X, appears in clause Cj , and is 1 
if xl appears in Cj. If neither literals appear in Cj, the label on the transition 

is 0,1. Each state gi, m +i is &n accept state. 

0 4 

If we could minimize tlfis NFA in polynomial time, we would be able to 
determine whether it accepts all strings of length m, and hence whether 0 is 
satisfiable, in polynomial time. That would imply P = NP. 


The clause (x V y) is logically equivalent to each of the expressions (x —► y) 
and (y -* x). We represent the 2cnf formula <f> on the variables xi,... , x m by 

a directed graph G on 2m nodes labeled with the literals over these variables. 

% 

For each clause in 0, place two edges in the graph corresponding to the the 

two implications above. Then we show that (f> is satisfiable iff G doesn’t 

• •••••% » 

contain a cycle containing both x* and xl for any i. We’ll call such a cycle 

inconsistency cycle . Testing whether G contains an inconsistency cycle 
is easily done in polynomial time with a marking algorithm, or a depth-first 

i 

search algorithm. 

We first show that if G contains an inconsistency cycle, no satisfying as- 

• i i • 

signment can exist. The sequences of implications in the inconsistency cycle 
yields the logical equivalence X* <4 xl for some i, and that is contradictory. 
Thus <f> is unsatisfiable. 

Next, we show that if G doesn’t contain an inconsistency cycle, <f> is satis- 
fiable. Write x -> y if G contains a path from node x to node y. Because 
G contains the two designated edges for each clause in <f> } we have x A y iff 
y A x. Now we construct the satisfying assignment. 

• * *. •• 11'*^ * ^ • 

4l 4l nj 

Pick any variable x*. We cannot have both X* -* xl and xl —► x* because G 
doesn’t contain an inconsistency cycle. Select literal x< if xl A x< is false, and 
otherwise select xl- Assign the selected literal and all implied literals (those 

reachable along paths from the selected node) to be True. Note that we never 

■ ^ ^ ^ 
assign both Xj and xj True because if x* -* Xj and Xi xj then Xj —> xl and 

hence x< A xl thus we would not have selected literal x< (similarly for xl). 
Then, we remove all nodes labeled with assigned literals or their complements 
from G, and repeat this paragraph until all variables axe assigned. 
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The resulting assignment satisfies <j> because it satisfies every clause. As soon 
one of the literals in a clause is assigned False, the other must be assigned 

ow from the negation of the falsified literal 


w 


True because G contains 


►:iii 


to that other literal. 
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